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Abstract. We develop a stochastic target representation for Ricci flow and normalized Ricci flow on smooth, 
compact surfaces, analogous to Soner and Touzi's representation of mean curvature flow. We prove a verifica- 
tion/uniqueness theorem, and then consider geometric consequences of this stochastic representation. 

Based on this stochastic approach we give a proof that, for surfaces of non-positive Euler characteristic, the 
normalized Ricci flow converges to a constant curvature metric exponentially quickly in every C'^-norm. In the 
case of C'^ and -convergence, we achieve this by coupling two particles. To get -convergence (in particular, 
convergence of the curvature), we use a coupling of three particles. This triple coupling is developed here only for 
the case of constant curvature metrics on surfaces, though we suspect that some variants of this idea are applicable 
in other cases too. At any rate, this triple coupling provides a purely probabilistic approach to getting second- 
order derivative estimates for second-order PDEs. Finally, for A: > 3, the C*" -convergence follows relatively easily 
using induction and coupling of two particles. 



1. Introduction 

In [23], Soner and Touzi give a characterization of various extrinsic geometric flows (with ambient space 
K"), including mean curvature flow, as stochastic target problems. More specifically, they introduce the 
relevant target problems and then prove associated verification theorems, namely theorems showing that 
if the curvature flow has a smooth solution for an interval of time t £ [0, T), then the solution agrees with 
the solution to the stochastic target problem on this interval. In the first part of this paper, we develop 
a similar characterization of Ricci flow (and normalized Ricci flow) on compact surfaces, including the 
relevant verification theorems (see Theorem 3). We then briefly brief discuss time-dependent bounds on 
the solution to both normalized and un-normalized Ricci flow and estimates on the blow-ups of solutions 
to Ricci flow in the cases of non-zero Euler characteristic, all obtained from the stochastic formulation of 
the flow. In the remainder of the paper, we use this stochastic representation to prove that, for a smooth, 
compact surface of non-positive Euler characteristic, given that a smooth solution to the normalized Ricci 
flow exists for all time (which is well-known from the literature), it converges to a constant curvature metric 
exponentially fast in C°° (see Theorem 22 for a precise statement). 

Ricci flow on smooth, compact surfaces is essentially completely understood, so none of the geometric 
results in this paper are new. Nonetheless, one feature of our approach is that probability often provides 
an appealing intuition, as in the case of Brownian motion and heat flow. Thus, if Ricci flow is thought of 
as a kind of "heat equation for curvature," it is natural to want to extend the analogy to include a diffusion 
interpretation. For example, it's nice to see the convergence of a manifold under normalized Ricci flow 
to a constant curvature limit as the equidistribution of the metric, and as a result of the curvature, from a 
probabilistic perspective. 

More generally, one might ask about the potential merits of developing stochastic techniques for Ricci 
flow (or other curvature flows). One obvious point to be made here is that one gets a representation of the 
solution and, at least in the theory of linear second order PDEs, this has turned out to be extremely versatile 
in extracting properties of the solutions. As we will see, the stochastic tools we employ are good enough to 
give a different proof of a main result in the theory of Ricci flow on surfaces with the bonus that we see the 
"averaging property of the curvature" as a consequence of coupling, which is a probabilistic manifestation 
of ergodicity. Another motivation for such an endeavor is that the stochastic target formulation is fairly 
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insensitive to regularity, and thus potentially useful in formulating notions of weak solutions. Indeed, in a 
second paper, Soner and Touzi [ ] show that generalized solutions to various extrinsic curvature flows can 
also be understood in terms of stochastic target problems. Also stemming from these ideas, we note that 
stochastic approaches to PDEs can lend themselves to the development of probabilistic numerical schemes 
(as in [ ]), but we do not touch this subject here. 

Our framework is not the most general one. We presumably could have worked in a little more gener- 
ality, but to keep the ideas as appealing and clear as possible, we decided to study surfaces, which are the 
traditional starting point for studying Ricci flow. 

We point out that, as noted in [ ], stochastic target problems of certain kind are equivalent to second- 
order backward stochastic differential equations. As discussed there, second-order backward SDEs are 
natural stochastic objects to associate with fully non-linear PDEs. Thus, one could presumably recast the 
results of this paper in those terms. Nonetheless, we have chosen to adopt the stochastic target approach 
because it seems more geometrically intuitive and visually appealing, and because it puts Ricci flow and 
mean curvature flow in a similar framework. 

There are few papers on stochastic analysis and Ricci flow, for instance [19, 18, 2, 17, 9, 1]. The ones 
that are somewhat close to our work are [ ] and [ ] . These papers investigate the Brownian motion (and 
the associated parallel transport) with respect to a time changing metric on a manifold of any dimension, 
not only on surfaces. Using stochastic analysis they also develop a Bismut-like formula to represent the 
gradient of solutions to heat-type flows with respect to the time-dependent metric. In particular, this leads 
to gradient estimates for the corresponding solutions. 

At any rate, though the Bismut formula is very useful for gradient estimates, we do not know how to get 
a nice and useful version of this formula for second -order derivatives. This is one of the reasons we prefer 
to deal with an alternative probabilistic tool, namely coupling. This idea for dealing with the second- 
order derivatives comes from [ ], where a coupling of three particles is used to estimate second-order 
derivatives of harmonic functions on Euclidean domains. This triple coupling indicated by Cranston uses a 
certain symmetry to get a key cancellation in the estimation of the Hessian. This symmetry is not surprising 
in the flat case. However, there are obvious technical challenges for a similar construction on manifolds, 
and the way it works in the flat case does not seem to work on arbitrary manifolds. Nevertheless, it turns 
out that we can construct such a triple coupling which has enough good properties in the case of surfaces 
of constant curvature. 

We continue with a few more observations about the present work. We do not prove the existence of 
solutions to the target problem directly; rather, the verification theorems proceed from the assumption that 
the Ricci flow admits a smooth solution. In the case of normalized Ricci flow, we have long-time existence 
as proved in [3] and [14]. However, an immediate consequence of such a verification theorem is that the 
solution (to the flow) is unique. 

In contrast to the standard proof of the convergence to constant curvature, we deal directly with the 
metric itself (and its derivatives), rather than introducing an auxiliary PDE satisfied by the curvature. We 
use uniformization to work with an underlying metric which has constant curvature and is in the same 
conformal class as the initial metric. One might hope to extend these arguments to more general situations, 
but for a first paper on this approach uniformization makes the analysis cleaner and reveals the power of 
the coupling in a nice way. 

The outline of the paper is as follows. We first describe the stochastic target problem in Section 2 giving a 
fair amount of detail, since it is a somewhat non-standard control problem. Then, in Section 3 we prove the 
verification/ uniqueness theorem, namely that, if there is a smooth solution to the Ricci flow (or normalized 
Ricci flow) on some interval of time, then it agrees with the solution to the stochastic target problem. 

Section 4 is a short section showing how one can use the representation to prove that the unnormalized 
Ricci flow develops singularities (in certain cases) either in finite time or in infinite time. In Section 5, we 
develop the a priori bounds for the stochastic target problem. As a consequence, we obtain the exponential 
convergence in the C°-norm of the normalized flow in the case of x(A/) < (as usual, x(A/) denotes the 
Euler characteristic of M). We also include a short discussion of the blow up of the unnormalized Ricci flow 
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in the cases x{M) > and x{M) < 0, which is in tune with the previous section's findings, although this 
time assuming uniformization. 

Section 6 introduces and proves the main result on mirror coupling for the time changed Brownian mo- 
tions associated to the target problems. This coupling makes sense for short times, but the main challenge 
is to show that the coupling extends beyond the cut locus. This is done using the geometric structure of the 
cut locus on surfaces of Euler characteristic less than or equal to 0. We should also point out that there is a 
coupling of Brownian motions constructed with respect to time-varying metrics (such as Ricci flow) in [ ! ], 
but it differs from our situation here. 

In Section 7 we start the main analysis of the convergence of normalized Ricci flow. We prove the non- 
trivial fact that in Euler characteristic zero, the normalized flow converges exponentially fast in the C°- 
topology. This uses the result from the previous section combined with the comparison of the distance 
process with a Bessel process in order to estimate the coupling time, which is a fundamentally probabilistic 
idea. Combining this result with those coming from the a priori estimates proves that, for non-positive 
Euler characteristic, the flow convergence in the C" -topology exponentially fast. 

The next task is to prove that the convergence takes place also in C^, or in other words that the gradient 
of the metric converges exponentially fast. This is done in Section 8, again using coupling. However, the 
point here is a little different. We use the coupling for particles started close to one another and estimate the 
coupling time in terms of the gradient of the metric and the initial distance. This in turn yields a functional 
inequality satisfied by the C°-norm of the gradient which is contained in Lemma 12. It turns out that this 
functional inequality is strong enough to produce the exponential convergence. 

Going forward. Section 9 is dedicated to the triple coupling used in a crucial way for the Hessian es- 
timates. We exploit in an essential way the constant curvature properties of the underlying metric. We 
have two mirror coupled particles x and y and another middle particle z which is moving on the geodesic 
between them which is described by the distance pi from z to x, or alternatively, the distance p2 from z to 
y. One of the main interests is the symmetry with respect to swapping pi and p2- The other thing thrust 
of the investigation is as follows. Assuming that x and y are time changed Brownian motions, we study 
the conditions under which z is a time changed Brownian motion with a drift. This is a key point in the 
Hessian estimates. 

Section 10 covers the Hessian estimates. Here we use the results from the previous sections, for instance, 
the exponential decay of the flow in the C^-topology and the triple coupling. As in the case of the gradient 
we end up with a functional inequality for the C^-norm of the Hessian as in Lemma 21. It turns out that 
this suffices to conclude the exponential convergence. 

The last section proves the C'' -convergence of the flow. This is done essentially using the Ricci flow 
equation and induction. It is important to mention here that in the flat case, we still have to use the coupling. 

A few words about the sphere case, which definitely requires some finer analysis. There are several 
obstacles we have to overcome. On one hand, the a priori estimates give bounds which blow up in finite 
or infinite time. However, these estimates are simply bounds of a stochastic differential equation in terms 
of the ODE in which the martingale is killed off, and eventually can likely be refined. Further, in the 
case of non-positive Euler characteristic, there is a unique stationary solution to the normalized Ricci flow 
with a given volume (in a given conformal class), and thus one has to prove that the flow converges to 
this uniquely determined solution. In the case of the sphere this is not the case and thus convergence is 
harder to establish, because we do not know beforehand toward which stationary solution the flow wants 
to converge (this is related to the issue of Ricci solitons). Therefore, the strategy we used in this paper for 
x{M) < needs some refinements if it's to address the case of positive Euler characteristic. 



2. Stochastic target formulation 

2.1. Ricci flow. Consider a smooth, compact Riemannian surface (M, h), that is, M is a smooth, compact 
manifold of dimension two and h a smooth Riemannian metric on M. Any other smooth metric in the same 
conformal class as h can be written as g = uh for some smooth, positive function u. The Ricci curvature of 
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any metric metric g is given by 

(1) 2RiCg = Rg = 2Kgg, 

where Rg is the scalar curvature and Kg is the Gauss curvature. The Ricciflow is defined as the evolution of 
the metric gt according to 

(2) dtgij = -2Ricij 

where Ric is the Ricci tensor. From this, it is easy to see that the Ricci flow preserves the conformal class in 
two dimensions, and thus it becomes an evolution equation for the conformal factor ut . In particular, the 
Ricci flow corresponds to u evolving by 

(3) dtut = Ah log Ut - 2Kh 

where Kh is the Gauss curvature of (M, h). In passing from (2) to (3), we have already used the fact that if 
g — uh, for two metrics, g and h, then (see [ , Exercise 2.8]) 

(4) Rg^^{Rh- Ahlogu) 

where the A/i is the Laplacian with respect to the metric h. 

This is a non-linear parabolic equation, and thus the usual probabilistic methods of solution (diffusions, 
Feyrtman-Kac, etc.) don't apply. Instead, we will adopt a stochastic target approach modeled on the ap- 
proach of [ ] to mean curvature flow, as mentioned above. 

To be more concrete, we assume that the initial metric on M can be written as go = uoh for some smooth, 
positive M and some metric h. There are two natural choices for h. Of course, we can let h = go and uq = 1. 
Alternatively, the uniformization theorem implies that there is a metric in the same conformal class as go 
which has constant curvature of —1, 0, or 1. Then we can take h to be this metric, in which case uq is 
determined by the condition that gQ — uoh. We will find the flexibility of this set-up to be useful. 

As usual, we also wish to introduce the normalized Ricci flozv, which is defined as 

(5) dtgij = -2RiCij + 2rgij 

where r is the average of the Gauss curvature on AI with respect to the metric g. Written in terms of the 
conformal factor, this is 

dtUt = Aft log u - 2Kh + 2rtUt. 
Under this flow, the surface is continually rescaled to preserve the area. Indeed, the Gauss-Bonnet The- 
orem tells us that the integral of the scalar curvature is 



j KgdAg = 27rx(M) 



where x(M) is the Euler characteristic of M and Ag is the area element of the metric g. Consequently, if rt 
is the average of the Gauss curvature for gt, then 

27rx(M) 
* area(M,5t) 

where area(Af, g) stands for the area of M with the metric g. From here, a straightforward calculation gives 
that 

dt aiea.{M , gt) ^ dt utdAh = / dtUtdAh = -2 / KhdAh + 2rt / utdAh = 0, 



which shows that the area is preserved under this evolution and, in particular, rt does not depend on t. 
Therefore the flow (5) preserves the area and 

(6) '-^^V 

area(M, go) 

We can now translate (5) into an equation satisfied by the conformal change ut as (recall that gt = uth) 

(7) dtUt = A/i log u - 2Kh + 2rut 
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with r the constant from (6). 

As is implicit in the above, we see that the set of all smooth metrics (on M) in a given conformal class 
corresponds to the set of smooth sections of a one-dimensional bundle over AI. More concretely, fixing a 
"reference metric" h and writing any other (smooth) metric (in the same conformal class) as uh induces a 
global coordinate u on fibers of this bundle making the total space E diffeomorphic to AI x (0, oo). Further, 
u is given as the composition of the lift from M to E (corresponding to the section) with u. This helps to 
explain the notation: u is a coordinate on the fibers, and u is the expression of a section in this coordinate. 
Because our bundle admits natural global coordinates, we will almost always work in these coordinates, 
and thus we won't have much occasion to consider sections in a coordinate-free notation. 

Viewed in this light, it is natural to introduce a new coordinate on the fibers. Let p = (1/2) logu. Then 
any other metric in the same conformal class as h can be written as g = e'^^h for some smooth function 
p : M — > M, which is given by the composition of the lift M — > E (corresponding to the section) with p. 
This coordinate makes the bundle into a real line bundle. In particular, the metric h corresponds to the 
zero section, and fiberwise addition corresponds to composition of conformal changes. However, we won't 
need the vector space structure on fibers in what follows; we really just view the fibers as having a smooth 
structure. In terms of the coordinate p, the Ricci flow equation becomes 

(8) dtPt = {A^p, - Kh) , 

and the normalized Ricci flow equation becomes (see also [20, Equation 1.3.1]) 

(9) dtP, = e-^P^{AnPt~Kn)+r. 

with r the constant defined in (6) and thus depending only on the are of M with respect to the initial metric 
50- 

At this point, we see that there is a one-to-one correspondence between metrics in the same conformal 
class as h, sections of E over M, and functions p (where all of these objects are assumed to be smooth). 
Further, there is a one-to-one correspondence between smooth sections and smooth hypersurfaces of E that 
intersect each fiber once and do so transversely; under composition with p this is the same as the corre- 
spondence between smooth functions on M and their graphs in M x M. Viewing metrics as h5^ersurfaces 
in the total space E provides a framework for studying Ricci flow which is fairly similar to that of mean 
curvature flow and well-suited for the stochastic target approach. Our next task is to define the appropriate 
target problem. 

2.2. The target problem. Let r(0) be the hypersurface corresponding to the initial metric g^. In spite of our 
previous efforts to distinguish between sections over M from their description in a particular coordinate, in 
what follows we will fix the global coordinate p on fibers, thus identifying the fibers with M, and formulate 
everything in those terms. In particular, r(0) corresponds to the graph of pg. The stochastic target problem 
is, for any time t, the problem of determining the set of points such that the controlled process, starting 
from such a point, can be made to hit r(0) (the "target") in time t almost surely. Obviously, this requires 
specifying the allowed controls and the processes they give rise to. We will generally explain things for the 
Ricci flow and then indicate the analogous results for the normalized Ricci flow in situations where there 
are no additional complications. 

We start with the infinitesimal picture in normal coordinates. We choose any point [q, p) E M xM. and let 
{xi,X2) be normal coordinates around q. Thus (xi, X2,p) are coordinates on a neighborhood of {q} x M. We 
assume that the controlled process is currently at {q,p), say at time t. The {xi , 2:2 )-marginal of the controlled 
process will be (infinitesimally) Brownian motion on i\/ (with fixed reference metric h), time-changed by 
2e~^P. The control consists of choosing a lift of the tangent plane to M at q into the tangent space to E at 
{q,p)- The controlled process has its martingale part diffusing (infinitesimally) along this lifted plane in the 
unique way that gives the right (xi, X2)-marginal, and has its drift along the fiber at rate e^^PRh (plus an 
additional —2Trx(M) / area(M, h) for the normalized Ricci flow). More precisely, the control consists of a 
choice of (ai, 02) G M-^, for which the processes evolves (infinitesimally, assuming the process is at {q,p) at 
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time r) according to 







' e-P 






e-P 






e-Pai e-Pa2 



\/2dW^' 
V2dWl 



e-^PKhiq) 



where W-^ and are one-dimensional Brownian motions. Here we have written Kh{q) to emphasize 
that the curvature depends on the point in M. The factors (in front of the Brownian differentials) are 
needed because the Ricci flow is defined using the Laplacian, instead of half the Laplacian, and rather 
than use a non-standard normalization for the Ricci flow, we choose to speed up our Brownian motions 
(this is analogous to the usual discrepancy between the analysts' and the probabilists' versions of the heat 
equation). This is the controlled process, at least rnfinitesimally, corresponding to the Ricci flow. For the 
normalized Ricci flow, the set of controls is the same, but the process evolves according to 



dxi^r 




' e-P 


dX2,T 




e-P 






e-Pai e-Pa2 



V2dW^ 
V2dWl 



e-^PKh{q)-r_ 



We point out that, for both the Ricci flow and the normalized Ricci flow, the (infinitesimal) diffusion 
matrix is 







2e-^P-ai 



2e-2p-a2 
2e-2p-(af +ai)_ 

in {xi,X2,p) coordinates at {q,p), of course. 

Having given the infinitesimal picture, we now extend this to a global description. While it is tempting 
to simply assert that this follows immediately from the local description, we prefer to give a more explicit 
formulation. There is more than one way to do this, but we choose to use the bundle of orthonormal frames 
on (M, h). The immediate difficulty with extending the above local picture is that, except in special cases 
(more on which below), we cannot find coordinates which are normal at more than one point at a time, or 
even a global orthonormal frame. The solution we have in mind is to use the bundle of orthonormal frames 
to supply each point along the evolving process with an orthonormal frame and its associated normal 
coordinates. In particular, let 0{M) be the bundle of orthonormal frames over {M, h), consisting of points 
{q, c{q)) where q G M and c{q) is an orthonormal basis for TqM with metric h. We identify z{q) with the 
corresponding linear isometry from to TqAI. Let ei and 62 be the standard basis for and let €(ei) 
be the corresponding canonical vector fields. Further, we let tt : 0{M) — > M be the usual projection and 
TT* : TO{M) — > TM be the induced push-forward map on tangent spaces. 

The connection with the previous infinitesimal picture comes from the following relationship between 
the canonical vector fields and normal coordinates. Choose a point q G M and a frame z{q) over q, and let 
{xi,X2) be normal coordinates (for {M, h)) in a neighborhood of q such that — z{q){ei) at q. Obviously, 
TT, [^(ei)|(g,c(g))] — dxilq- Moreover, let s be a smooth section of 0(M) in a neighborhood of q which is equal 
to e(g) at q and horizontal at q, meaning that dx^s are horizontal vectors at q. Then tt* [€{ei) o s] agrees with 
to first-order around q. (Indeed, to show that such a section s exists, start with normal coordinates and 
apply the Gram-Schmidt process to {dx^ , 8x2} at every point in a neighborhood of q.) 

We also recall the connection between the bundle of orthonormal frames and Brownian motion on (A/, h) . 
Wehavethat (2;(ei)^ + £(62)^) /2is Bochner's Laplacian on ©(A/), and the corresponding martingale prob- 
lem is well-posed (in the sense of Stroock and Varadhan, namely that there is a unique solution for any ini- 
tial point). We use to denote such a process. Projecting 13^ to M gives Brownian motion on M, which we 
denote Br ■ This is the well-known Eells-Elworthy-Malliavrn construction of Brownian motion on M and 
we refer the reader to [15] or[24] for a detailed account on the subject. Moreover, the process Br on 0{M) 
should be thought of as the horizontal lift of Br on M, and thus as giving Brownian motion equipped with 
parallel transport. In particular, this is how we will typically understand Br, as Brownian motion on M 
endowed with parallel transport. Finally, we note that the solution to the martingale problem for Bochner's 
Laplacian can be realized as the (unique) strong solution to the natural SDE driven by a standard Brownian 
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motion on R^, or equivalently, two independent, one-dimensional Brownian motions. That is, B^- can be 
realized as the solution to 

dBr = (£(ei) o dW^ + €(62) o dW^ 

where odW indicates that the differential is to be understood in the Stratonovich sense. 

We now have the necessary background to give the global formulation of the stochastic target prob- 
lem for Ricci flow (and the related target problem for normalized Ricci flow). We write points in E as 
{x,p) e A/ X M and the controlled process (for the Ricci flow) as = [xttPt)- As suggested above, the 
Af-marginal Xt will be Brownian motion on M, time-changed by p, and thus we know from the above that 
we have parallel transport of frames (for T^M) along the paths xt (note that the frame is always orthonor- 
mal relative to the metric h). In particular, if we choose a frame e(a;o) at the starting point, then we let ({xt) 
denote the parallel transport of this frame along x^- Abstractly, the control consists in choosing a lift of 
Tx^M to T(x^,p^)E. In terms of our evolving frame, such lifts can be identified with points of M?. This the 
time to formally introduce the control process. In what follows, (17, P) is a probability space where the 
Brownian motion {W^ , W'^) is defined and the reference filtration involved here is Fr, the one generated by 
the Brownian motion. 



Definition 1. For a fixed time t > 0, an admissible control process A is a bounded map A: [0, x Af x O — > I 
which is continuous in the first two coordinates, and such that for each {x,t) £ M x [0,t\, A{t,x) : — >• 
FT-measurable. We write this in components A = (ai, 02). 



IS 



We will explain below in the first remark of this section why we require the control to be bounded. 
If we start our process from a point Fq = (2^07^0) equipped with a frame e(a;o) of T^^M, then it evolves 
according to the SDE (note that we're using both Ito and Stratonovich differentials) 



(10) 



dxr = e 



dpr 



.i=l 
2 



J2a^V2dW^ 



+ e-^P- Kh{xr) dr. 



Here we see that z{xT){ei) is just the projection onto M of €{ei) and to ease the notation we will also use the 
shortcut c{xr){ei) — ei{xr), or even more simply e^, if there is no confusion generated by dropping Xr- In 
particular, the horizontal lift of Xr, which we write Xr ~ {x^-, t{xr)) evolves according to 

" 2 

dir = e^P- <&{ei)V2 o dWl on 0{M), 

.2 = 1 

and the first line of (10) is just the projection of this onto M. We choose to write (10) in this fashion in order 
to emphasize that we're ultimately only interested in the evolution of the surface in E and not in the frame; 
the frame is only used as a convenience in order to express the control and the corresponding SDE. We do 
this despite that fact that (10) requires evolving the frame z{xr) as well. 

The mixing of Ito and Stratonovich differentials in (10) is a result of the fact that horizontal Brownian 
motion (or just Brownian motion on A/) is not easily written globally in Ito form. To clarify this, we give 
the following equivalent characterization, which is just a consequence of Ito's formula. For any smooth 
fimction : [0, T] x Af x M M (assuming that the process (x^jPt) exists for r G [0, T]), 



dip{T,Xr,Pr) =e P^Y{c.,{xr)ip + anp')V2dW^ 



(11) 



i=l 

+ (dr^ + e-^P- Kh{xr)(p' + e'^P-Ah^ + e'^f- ^ a^^cp" + 2e-^P- ^ a,{c,^)^'\ dr 

\ i=l 1=1 / 



where all the "inside" functions are evaluated at (r, Xr^Pr), ii{x)(p signifies the derivative (along ei{x)) with 
respect to the second variable of (p, dr^p is the derivative with respect to r variable, and the prime is the 
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partial derivative with respect to p. Note that if we let (x 1,2:2) be appropriate normal coordinates at a 
point, then applying this to xi, X2, and p shows that, at that point, this agrees with the infinitesimal picture 
described above. 

We now take a moment to discuss what we mean by asserting the the controlled process arises from the 
control via the SDEs just mentioned. We understand these (systems of) SDEs in the weak sense, that is the 
choice of driving Brownian motions {W^ , W^) is part of the solution, not prescribed in advance. Of course, 
for an arbitrary choice of controls, a solution need not exist, and if it does, it may not be unique in law. We 
will have more to say about this later, after we introduce the target problem. 

Now that we've specified the admissible controls and described the evolution of controlled process 
Yr {A) that a choice of control gives rise to, it's time to explain how this gives rise to a subset of E. 



Definition 2. We define the reachable set at a given time t e [0, 00), denoted V{t), to he the set of points in E for 
which there exists an admissible control such that the controlled process, started at this point and with this control, is 
in r(0) at time t almost surely. 



We follow Soner and Touzi [ ] in calling this the reachable set, even though it's the set of points you can 
reach a fixed target from, not the set of points you can reach from a fixed starting point. In order for this to 
be well-defined, we need to show that V{t) doesn't depend on the initial choice of frame. Suppose Ar is a 
control such that Yr{A), started from y £ E with initial frame e(j/), hits r(0) at time t almost surely (so that 
y e V{t)). If e(y) is any other (orthonormal) frame at y, then there is some r E 0{2) such that z{y) — rz{y). 
It's clear that A-^r is such that Yr{Ar), started from y G E with initial frame e(j/), hits r(0) at time t almost 
surely. Thus a point of E is in the reachable set or not independent of what frame we use to express the 
controlled process, and so the V{t) are well-defined. 

For a point in the reachable set, we will indicate the control in the definition by A, if necessary indicating 
the point in V{t) by writing A{xo,Po) or A{Yq), and call it a successful control (this seems linguistically 
more appropriate than optimal control). In light of the fact that this depends on the initial choice of frame, 
a successful control should really be thought of as a family of controls indexed by 0(2). However, since the 
dependence on the initial frame is so simple and not our primary focus, we will generally gloss over this. 
We will also write Yr{A) as Yt- Thus, the defining property of a point in V{t) and the associated successful 
control is that if we start the process at this point in V{t), then Yt{A) G r(0) almost surely. This necessarily 
requires that, for a successful control A, there exists a solution to Equation (10) and thus a corresponding 
process Yt-{A) for all time t E [O.t]. In particular, one might imagine that some choice of control gives rise 
to a solution under which p^ blows up prior to t {xr can't blow up since M is compact), but such a control 
cannot be a successful control by definition. The definition does not require that a successful control gives 
rise to a solution Y^. (A) which is unique in law, despite the fact that our notation makes it look as though 
Yr is always determined by A. (So it is conceivable that a successful control might give rise to another 
solution Yr that doesn't almost surely hit the target.) Nonetheless, we will see below that, as long as a 
smooth solution to the Ricci flow exists, there is essentially only one choice of successful control starting 
from a given point of V{t), that it is well-behaved, and that this control uniquely determines Yr- 

Finally, we recall that the stochastic target problem is the determination of the reachable sets V{t). We 
note that V^(0) — r(0); understanding V{t) for positive t and its relationship to Ricci flow is the topic of the 
next section. Looking ahead, what we will prove is that, assuming the Ricci flow has a smooth solution for 
some interval of time, that solution agrees with the solution to the stochastic target problem in the sense 
that V{t) = r{t) at all times in this interval. 

Naturally, we have an analogous set-up which we associate with the normalized Ricci flow. The set of 
admissible controls remains the same, but now the controlled process, which we denote Y^ (A) (the "n" in 
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the superscript standing for "normalized") evolves according to 

" 2 



dxr = e-P^ ^{Xr){e^)V2 O dW^ 



dp^ = e-P- a,V2dWl + {e-^P-Kh{xr) ~ r) dr. 



,i=l 



Note that the only difference from Yj- is that the drift of pr has an extra term. 

We denote the corresponding reachable sets by We also have the analog of Equation (11) where 

e^^P^Kh there is replaced by e^^P^Kh, — r: 



Remark. We want to discuss why we insist that our control (ai , a2 ) is in L°° . We begin by describing a simpler situation 
wldch illustrates the essential point. Suppose we consider a real-value controlled process given by 



where at is an adapted real-valued function which serves as the control. If we consider the goal to be to make the 
process xt hit in within time 1 (and we stop the process when it hits 0), then we would like to assert that this is 
impossible, because, for instance, it would violate the martingale property of xt . However, without some additional 
restriction on at, this will not be the case. For example, consider the following scheme for controlling the process. For 
t £ [0, 1/2), we let a be the constant such that the process has probability 1/2 of hitting by time t = 1/2. It is clear 
that this is possible, since letting a be constant means that xt is simply a time-changed Brownian motion, and we know 
that Brownian motion almost surely hits the origin in finite time, no matter where it is started from. Then ait — 1 /2, 
the process has hit and been stopped with probability 1/2. If it hasn't, then 2:1/2 is some positive value. Again, we 
can find some constant value for a, depending only on 2:1/2, such that if we let at equal that constant for t G [1/2, 3/4), 
then the process hits in that interval of time with probability 1/2. Thus, by time f = 3/4, the process has hit with 
probability 3/4. Now we can iterate this procedure, at each step using up half of the remaining time, in order to get xt 
to hit with probability 1 by time t = 1. If we do this, the resulting process Xt will no longer be a martingale on the 
interval t G [0, 1] but instead merely a local martingale. Part of the point is that this is a simple trick. We can think of 
at as determining a time-change so that xt is a time-changed Brownian motion, and since we know Brownian motion 
hits the origin in finite time, if we're allowed to speed up time as much as we'd like we can simply compress the entire 
lifetime of the Brownian motion prior to the first time it hits the origin into a finite interval. 

We now return to the target problem we associate to Ricci flow. In light of the above, if we assumed only that (ai , 02) 
was adapted, we could imagine a similar procedure of choosing the control to be very large so that, from any starting 
point, we could cause it to hit (this is a moving target, but it varies in a smooth fashion and stays bounded) by time 
t. Once it hits Vt-ri ^6 could then "switch" to the successful control described in the next section in order to hit Pq as 
time t. The result would be that every point would be in V{t), which is obviously not what we want. Of course, what 
we've just described uses a discontinuous control, but one can imaging smoothing it to get a continuous analogue. At 
any rate, the imderlying logic of this "bad" control justifies our wish to avoid unboimded controls. 

Requiring that (ai, a2) be bounded prevents this kind of easy trick and forces a successful control to respect the 
geometry of the situation. Of course, one might imagine that there might be other, less restrictive, ways to achieve this, 
such as requiring the controls to be in some L'' -space for finite p or requiring some natural coordinate to be a martingale, 
as opposed to merely a local martingale. Indeed, if one were to extend this stochastic target formulation to include, say, 
non-compact surfaces, it seems like some weaker assumption on the control would be appropriate. However, for the 
present paper, we have no need to speculate on what other conditions one might want in other circumstances. 

Remark. We close this section by noting that, in the case when {M,h) is flat (and thus either a torus or a Klein bottle), the 
orthonormal frame bimdle is unnecessary. In particular, uniformization implies that (M, h) is isometric to 'M? modulo 
the action of the group of Deck transformations A. If we let 2:1 and X2 be the usual Euclidean coordinates on R^, then 
h = dx\ + dx% (after identifying M with R^/A). Further, {Wl , W^) is Brownian motion on {M, h), once we take it 



(13) 



2 



dip{T,Xr,Pr) =e P^ Y{ei{xr)ip + ai<f')V2dW^ 




dxt = at dWt, xo = 1, 
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modulo A. In this case, the set of controls are adapted, time-continuous, bounded maps into {(ai, 02) : G R}, and 
the controlled process simplifies, so that it is given, for both Ricci and normalized Ricci flow, by the SDE 



dxi,T 




' e""- 


dX2,T 




e-"^ 









V2dW^' 
V^dWl 



Convention. Throughout this paper very often we will have a fixed time t > so that the stochastic target problem 
is defined on [0, t] or the (normalized) Ricci flow is defined up to time t. Since the process time is always going to be 
in [0, t], all the stopping times involved will always be minimized with t so that the stopped process is well defined. 

Also, the constants involved in the main estimates may change from line to line in such a way that they do not 
depend on time t. 



At this point, we've described a pair of closely related stochastic target problems, namely the deter- 
mination of V{t) and V^"(t), which we associate with Ricci flow and normalized Ricci flow, respectively. 
However, we've given no justification for these associations. In the present section, we prove that, under 
the assumption that a solution to the Ricci flow exists, the solution is given by the reachable sets. This 
justifies the introduction of these particular stochastic target problems in the context of Ricci flow. 

Continuing with the notation of the previous section, we suppose that there is a smooth solution p^ to 
the Ricci flow, that is, to Equation (8), with initial condition on the interval t £ [0, T) (where we allow 
the possibility that T = 00). At each time t, we can associate the solution with a section of E over M and 
thus with a submanifold of the total space E, which is smooth and intersects each fiber once, transversely. 
We call the resulting submanifolds T{t) and note that this extends our earlier definition of r(0). Of course, 
knowing the T{t) for t G [0, T) is equivalent to knowing p^. Similarly, suppose there is a smooth solution p" 
to the normalized Ricci flow, that is, to Equation (9), with initial condition Pq — Pq on the interval t G [0, T") 
(where, for the same manifold {M, h) with the same initial metric go, it is not necessarily true that T and 
T" are equal). Then we have the associated submanifolds r"(t) of E. The connection between the Ricci 
flow and normalized Ricci flow (viewed in this way) and the stochastic target problems introduced above 
is given by the following theorem. Note that both this sort of result and the method of proof mirror that of 
[^^]. The main additional complication, besides the geometric formalism needed for the general statement 
of the target problem, is that the controls are not restricted to a compact set. 

Theorem 3. Let (A/, h) be a smooth, compact Riemannian surface with initial metric go = e'^P°h, as above. Suppose 
that the Ricci flow has a smooth solution p^ on t £ [0, T). Then r(<) = V{t) for all t G [0, T). Similarly, if the 
normalized Ricci flow has a smooth solution p^ out <E [0, T"), then r"(t) = V"{t)for all t G [0, T"). 

Proof. We start with the Ricci flow. We fix some t G (0, T) and let r be the time parameter for the controlled 
process Yt-{A), t g [0, t] (as usual in probabilistic approaches to PDEs, process time runs "backward" com- 
pared to PDE time). We consider the square of the vertical distance between the controlled process IV and 

r(t — t). Thatis, we consider 77(2;, p, r) = (p — (a;)) ^ along the paths of IV/ so that jyr = [pt —Pt-T{^r)Y- 
Actually, we begin by considering a slightly more general quantity. Let £,{x,p, t) = p — Pf._^{x), and for 
the moment let (,c : M [0, 00) be any smooth function. We wish to consider (p{S^{x,p, r)); clearly ry is just 
the special case (p{z) = z^. 

We now apply Ito's formula (11) to {ip{£,))r. In the following, p is always evaluated at time t — t and 
position Xr, we write e, for e{xr){ei), and we suppress other arguments (such as for the controls a;) as 
desired to make things more readable. Then we have 



3. Verification and the connection with Ricci flow 



2 



(14) 




+ e~'^P^ [f'Kh + ip" {aj + af)] dr + 2e '^P^tp" [-aiZip - a2C2P] dr. 
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Recall that + is just A/j. Then a little algebra and the fact that p satisfies Equation (8) allows us to 
simplify this, yielding 

(15) d{^{^))r = %/2(^'e-P^ [(ai - tip) dW^ + (as - t2P) dW^] 

+ [e-^P^if" [{a, - t,pf + (a2 - t^pf] + ^' (e^^P _ g-2p.^ (^^^^ _ Kh)]dT. 
We now return to considering 77. In this case, this equation specializes to 

(16) dijr = 2V2 {pr - p) e-P- [(ai - eip) dM^^i + (03 - t2P) dW^] 

+ 2e-^P- [(ai - tipf + (a2 - t2pf] dr + 2 {pr ~ p) (g-^P _ g-^P-) (A,,p - A',,) dr. 

First we show that any point {x,p^{x)) in r(f) is in V{t). Obviously, this is true for t — {). Now choose 
t > 0. We choose our controls ai and 02 as follows: For r e [0,t], we let ai be ei]3t_^(xT-) and 02 be 
e2Pt-Ti^r)- Thus, our controls are Markov with respect to the process' position and the time (and the 
"current" frame, although this is largely just a convention, as discussed above). Intuitively, all we are doing 
is trying to cause the process to be tangent to the evolving solution given by p. Our controls are not only 
Markov in space and time, but they are given by evaluating smooth functions of space and time (and the 
lift of "space" into the orthonormal frame bundle) along the controlled process, and thus we know that 
the system of SDEs for has a unique strong solution. In particular, is uniquely determined by these 
controls. Using these controls. Equation (16) simplifies to 

diir ^2{pr- p) (e^'P - e-'^P^) {AhP - K^) dr. 

Because p is smooth on M x [0, T) and M is compact, we know that both (x) and AhPf_^ {x) — are 
bounded on {x, r) e M x [0, t]. Now choose any (5 > and let 9s — inffr : rjr > S} be the first hitting time 
of 5. Also observe that both the controlled process Yr = {xr^Pr) and rjr have continuous paths. If we stop 
our process at Og, then pr is also bounded (this follows from the fact that p is bounded and the definition of 
rj). Combining the boundedness of both p and pr with an easy estimate for the exponential function, we see 
that e"^^ — e^^P^ is bounded above and below by a constant multiple of ± [pr — p), respectively. It follows 
that (for T < 6s), we have drjr < Crjr dr, for some positive constant C depending on t, 6, and the bounds 
mentioned above. Recalling that 770 = 0, because we start our controlled process on r(t), and integrating 
gives 



/•T/\es 

ilT/\es < C* / Vsds for T e [0, t]. 
Jo 



Then Gronwall's lemma implies that 77^^51^ = for all re [0,t\. Because 7]^ has continuous paths, this means 
that 9s > t, and thus we have that rjr = for all t e [0, t]. In particular, j]t — 0, and so Yt G r(0). Thus we've 
shown that r(<) c V{t). 

Next, we need to show the opposite inclusion, V{t) C r(<). Again, this is clear for t = 0, so we fix some 
t e (0, T). We have some starting point {a, f3) E M x M, and we assume that there exists a control (ai, 02) 
such that lV(ai7 02) almost surely hits r(0) at time t — t. 

At this point, we produce a mollified version of 77 by a judicious choice of (p. In particular, we now let 

: M — > [0, 00) be a smooth, symmetric function satisfying the following additional properties: ip is non- 
decreasing on [0, 00), Lp{z) = in some neighborhood of 0, and p is constant on \A, 00) for an appropriately 
chosen constant A. It follows that the value of Lp on \A, 00) is positive, (/? is only at 0, and all derivatives 
of p are bounded. If we now let i'i{x,p, t) = (p{^{x, p, r)), then 77 is a mollified version of 77, in the sense that 
they agree for small values of 77 but 77 is bounded, along with all of its derivatives. 

Let D{t) — E [fir]- Then Equation (15) shows that 

(17) D{t) = D{0) + rE\e-^P-^"\{ai-eipf + {a2-Z2pf]+p'{e-^^-e-^P-){Ahp-K, 



ds. 

Here, of course, the derivatives of p are evaluated at ^(iViPr, ''■)■ Note that cip, Z2P and Ahp — Kh are all 
bounded. Also, for small ^ we have that p" = 2 and p' = 2 [pr — p), and both of these derivatives are 
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bounded for all ^. Moreover, both e'^'^'P^tp" and (/?' (e"^'' — e^^^^) are bounded because the derivatives of 
if are identically zero for ^ > A. In addition, for any two constants Ci , C2 > 0, there is another constant 
C3 > such that for any ^ G M, 

Combining all of these with the fact that ai and 02 are bounded, we see that there is some positive 
constant C (different from the one above) depending on the bounds just mentioned, but not on r, such that 

D{t) > D{0) -C [ D{s) ds for all t e [0, t]. 
Jo 

This is trivially equivalent to 



-Dir) < -DiO) + i-C) 



I i-Dis)) ds forallr e [0,i]. 



Applying Gronwall's inequality for the function —D{t) on this interval then yields 

-D{t) < (-D(O)) e-^^ for all r e [0, t]. 

In particular, letting t ^ t and multiplying by —1 gives £'(0)e~*-^* < D{t). By assumption, the controlled 
process hits r(0) at time t a.s., and thus D{t) = 0. Since D is always non-negative (because (p was chosen 
to be non-negative) and e~^* is positive, we conclude that £'(0) = 0. This is equivalent to saying that our 
initial point {a, (3) is in T{t). Thus we have proven that V{t) C r(t). 

The proof for the normalized Ricci flow is almost identical. With the appropriate quantities, p,Pt, Xr and 
so on, equation (14) becomes 

d{^{£,)\ -N/2(^'e-P- [(ai ~ tw) dW^ + (02 - Z2P) dW^] 
2 

(18) + e""''^ [v" {-upf + ^' (-e^p)] dT + ^'dtpdT 

+ e-2P- [if'Kh - re^P- + tp" (a? + al)] dr + 26^2?-^" [-aitip - a2C2p] dr. 

and then from (9), we get exactly the same equation from (15), thus the rest of the proof is identical. □ 

From the point of view of control theory, the above result is a verification theorem. From the point of 
view of PDF theory, this can also be thought of as a uniqueness theorem. In particular, it shows that smooth 
solutions to the Ricci flow are unique and we state this in the following. 

Corollary 4. If there is a (smooth) solution to (normalized) Ricci flow on the time interval [0, T), then it is unique. 

It bears repeating that the above relies on already knowing that the Ricci flow has a smooth solution on 
some interval; in other words, it sheds no light on the existence of a solution (to either the Ricci flow or the 
control problem). On the other hand, this existence is well known in the present case. Cao [ ] and Hamilton 
[14] show that, for a smooth, compact initial surface, the Ricci flow always has a smooth solution on some 
(non-trivial) interval of time, and the normalized Ricci flow has a smooth solution for all time. (Of course, 
much more can be said, including the relationship between the normalized and un-normalized flows, but 
again, this is well-known and can be found in any book on the subject.) For an accessible overview we refer 
to [/ , Chapter 5], which treats the (normalized) Ricci flow on surfaces. 

One additional feature of the successfully controlled process is that it provides Brownian motion on M 
under the backward Ricci flow (or backward normalized Ricci flow, of course), as we now explain. If we 
put a smooth family of metrics gr on a smooth manifold M, then a process S,- is a Brownian motion on 
(M, Qr) if it solves the martingale problem for the time-inhomogeneous operator Ag^. Suppose we have a 
smooth solution to the Ricci flow, as above, for i e [0,7"), and let gt be the metric on M corresponding to 
this solution. Then if we choose a time t (in (0, T)) and point xq € M, there is a unique point (xojPo) over 
xq (where, of course, we use our standard fiber coordinate p) in r(i) = V{t). If we now run our successfully 
controlled process Yr ~ (xttPt) starting from this point, we know that it is on T{t — r) for all t e [0, t), 
or equivalently that pr = p^_^{xr), for all r €E [0,^] almost surely. Then looking at Xr (which is just the 
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M -marginal) and recalling that gt — e^P* h, a little thought shows that x^- is a Brownian motion on (M, gt-r) 
for r e [0,t]. That "process time" runs backward compared to "PDE" time, which manifests itself in the 
t — T parameter (with t fixed and r increasing) for the metric g, explains why we get Brownian motion on 
M under backward Ricci flow, as opposed to just Ricci flow. 

For clarity, let us temporarily denote Xr under the successful control as Xr ■ Then recognizing it as Brown- 
ian motion under backward Ricci flow gives a way of representing the solution to the Ricci flow (or normal- 
ized Ricci flow) that looks more like the usual representations for parabolic (linear) PDEs. In the special case 
when h is flat, normalized and un-normalized Ricci flow are the same, and we see that pr is a martingale. 
Further, we have that 

(19) p,(xo)=E--'[Po(it)] 

where the expectation is taken with respect to the successfully controlled process started from (xo,|?t(xo)) 
and run until t = t. This is analogous to solving the heat equation with some initial condition by running 
Brownian motion and then using it to average the initial condition. The difference is that, for the heat 
equation, we can construct Brownian motion (or more analytically, the heat kernel) without already having 
a solution to the heat equation with our initial data. This is because Brownian motion (or the heat kernel) 
doesn't depend on the initial data, and so we can use it to solve the heat equation in the first place. All of 
this is a manifestation of the linearity of the heat equation. In the case of Ricci flow, we need to know pr in 
order to determine Xr (or more accurately, these two are intertwined by the system of SDEs they solve), so 
we can't first determine Xr and then use it in the above to solve the Ricci flow. 

Also, we can now say a bit more about the recent work of [ ] and [ ]. They give a lift of Brownian motion 
on a manifold with time-dependent metric to the frame bundle which gives the parallel transport along 
the Brownian paths. They then introduce a notion of damped parallel transport which, under the Ricci 
flow (but not the normalized flow), becomes an isometry as well. This damped parallel transport can be 
used to produce martingales from solutions to heat problems under the Ricci flow. In our notation, Xr is 
the Brownian motion with respect to a time-dependent metric (with an additional factor of to get the 
normalization right, of course), and {e^^^ e{xr){ei), e~^^e(a;r)(e2)} (which is an orthonormal frame for the 
time-varying metric) gives the parallel transport along the Brownian path Xr- 



4. The blow ups of the Ricci flow for the case of positive or negative Euler characteristic 



This section is dedicated to showing that in the case of the (unnormalized) Ricci flow, there are blow ups 
either in finite or infinite time if the Euler charaterisitc, and hence the reference metric Kh, is either positive 
or negative. 

Assume now that the Ricci flow has a smooth solution defined on the time interval [0, T). Then, from 
Theorem 3, we learn that for any fixed time t e [0, T), p^ = Pf_^{xr) where (a;^,p^) is the solution to (10) 
with the initial conditions {x^pq{x)). On the other hand, taking a smooth function (^9 : [0, i] x M M in (11), 
we obtain that 



i=l 



dMT,Pr) + e-^P-{ip'{T,Pr)Kh{Xr) + ip"{T,Pr) 

i=l 

Since the successful control is given by = tip^_^, we get 

2 

Y,a}^\Vpt-r{Xr)\', 

1=1 

and this means that 

v{T,Pr) - / [dM'y.Pa) + e-'^P' {ip'{cT,p^)Kh{x^) + ifi" (a, p^)\V Pt-a{xa)\^)] da 



dr. 
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is a martingale. In particular, taking expectation at times t — and t — t and using — pt-rixr), yields 

(20) /•* , , 

- / [dM<J,p,) + e-^P' {^'{a,p,)Kh{x,) + ^"(a,p,)|Vpt„<,(x,)|2)] da. 



There are two obvious obstructions stemming from this formula. The first one is that if Kh {x) > for all 
X E M, then taking tf{T,p) = e'^P, the above formula (20) implies 

^2p,{x) ^^ix,t)^^2po(x,)^_2 f E'^^-t) [Kh{x^) + 2\\'pt-aixa)\^]da <E^'=''\e^P°^'''^]~2 f E^^'*^[Kh{x„)]da 

Jo Jo 
and thus, upon denoting the uniform norm by | • |„ and taking Kq = inix^M Kh{x), 

As this is true for any t € [0, T), the extinction time of the Ricci flow is finite and is certainly at most 
e2|po|u^^2i4ro). Therefore, in the case of positive curvature the flow develops singularities in finite time. 

On the other hand if the curvature is negative {K/^ < on M) then there are some constants Ci , C2 > 
such that 

Pt{x) > log(Cii + 1) - C2 for all xE M and t > 0. 

To see this, take Kq = inixeM ^Kh{x) > 0, thus Kh{x) < —Ko < and then consider ip{T,p) = pin (20) to 
deduce that 

Pt{x) = [po(a:t)] - / E(^'*)[e-2P-i^,(a;,)]da > inf po 
Jo 

which means that pt {x) is bounded below uniformly in f > and x e M. Now consider the test function 

ip{T,p) — exp (^a{t — T — jk^^"^'^))- Since pt{x) is bounded below, this implies that for large enough a, 

^"{^^Vry) ^ 0. On the other hand, dr(p(a,p) — Koe^'^Pip' {a,p) = 0, and this combined with the preceding 
and the fact that (p' is negative leads to 

ifiO, Ptix)) <E'--[ipit,po{xt))] < 1, 

which means that pt{x) > ^ \og{2KQt) for any t > for which pt exists. In particular this shows that either 
the flow ceases to exist after a finite time, or, if it does exist for all times, pt (x) goes to infinity uniformly 
over X E M. The moral is that we can not expect the Ricci flow to converge as the time approaches either 
the extinction time or infinity 

For the flat case, since the curvature is 0, the normalized and the unnormalized Ricci flows are the same 
and thus we will treat this case as the normalized Ricci flow. 

Remark. The blow up in the negative case does not take place in finite but this requires more arguments. However we 
see this 



5. Time-dependent a priori bounds for Ricci flow 

We now turn our attention to using the stochastic target representation for the normalized Ricci flow to 
derive (more accurately, of course, to re-derive) geometric facts about the flow. We will always work with 
the case where the reference metric h has constant curvature. By uniformization, this is no loss of generality, 
and it simplifies the analysis considerably. After a preliminary rescaling, we can assume that this constant 
curvature is either 1, 0, or —1. Further, we can rescale the initial metric go so that it has the same area as h. 
Thus, without loss of generality, we are in one of three cases (by the Gauss-Bonnet theorem). First, if the 
Euler characteristic of M is positive, we have that Kh = r = 1. If the Euler characteristic of M is zero, we 
have that = r = 0. Finally, when the Euler characteristic of M is negative we have that = r = — 1. 
The bounds we have in mind are similar in all three cases, although the differences in sign of Kh result in 
important differences. 

We call these bounds "a priori" because they don't depend on the structure of the reachable set. We 
elaborate on this after Theorem 6. 
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We have one more comment about notation before we begin. Because we will be concerned with the 
normalized Ricci flow for the rest of the paper, we drop the "n" superscripts. Thus, for instance, we let pt 
denote a solution to the normalized Ricci flow, unless otherwise indicated. 

The uiterestuig feature of choosing /i to be a metric of constant curvature is that the drift of the SDE 
satisfied by pr doesn't depend on Xr (although the target always does, except in trivial cases). In particular, 
we have the following three cases: 

" 2 



1 



(21) 



dpr = 

r = : dpr = 

r = — 1 : dpr = 



2 



-2Pt 



1) dT 



J2a,V2dW^ 



.1=1 

2 



(l-e-^P^) dT 



In general, the stochastic target problem for the normalized Ricci flow (and also the Ricci flow itself) 
gives an equation of the form 

" 2 



(22) 



dpr 



.i=l 



UripT)dT, 



where the controls at, i = 1,2 are bounded and chosen such that pt is almost surely on Ma, the section 
corresponding to po in the bundle M x M. In the case at hand we assume that Ur{p) is a function U : 
[0, t] X M — > M which is uniformly locally Lipschitz in the second variable, i.e. for any L > there is a 
constant Cl with |C/r(p) - Uriq)\ <Cl\p~ q\ for all r G [0, t] and p,q e [-L, L]. 

The basic point is that there are natural barriers for pr given in terms of equation (22) where the martin- 
gale part is set to be equal to 0. To be precise, we define a barrier as a solution q^. to the ODE 

(23) dqr = Ur{qT)dT. 
In this framework we have a general result as follows. 

Lemma 5. Assume that p^ and q^ are solutions to (22) and (23) respectively for re [0, i] with U a uniformly locally 
Lipschitz function in the second variable on [0, t] x K. 

If at any time ti g [0, t), p^^ < qr^ with positive probability, then at any later time t2 e {Ti,t], pr^ < Qra ^ith 
positive probability. 

Similarly, if at any time ti e [0,t), pr^ > qr^ with positive probability, then at any later time & iTi,t], 
Pt2 > qT2 positive probability. 

Proof. The proof is a basic application of stopping time and Gronwall's lemma. We will prove only the first 
part, the second one being similar. 

So, assume that gn > Pn with positive probability, and therefore that we can choose a constant L > 
such that L> qr^ — p^ > l/L with positive probability. We further take L large enough so that \qr\ < L for 
allr e [0,t]. 

Now, for any smooth function : M — > M, we have 

(24) fj{q,-p,) ^ r^{q,^-p,^) + Mr+ f {e'^P^ rj" {qs - Ps)iai{s) + al{s)) + rj' {qs - Ps){Usiqs) ~ UsiPs))) ds 



where Mr is a martingale with M{ti) — 0. Further, we choose the function such that it is non- 
decreasing, equal to for f < 0, equal to 1 for £,>2L and 77 (^) = for small ^ > 0. 

Next, we define the stopping time a = mi{u > ti : p„ > g„} A t. With this setup, we will denote for 
simplicity rjr ~ ri{qr — Pr), = ^'ilr — Pt) and 77" — T]"{qr — Pr)- Furthermore, from (24), 



E[l[,^,,] (s) {e-^P'ri'^{ai{s) + al{s)) + 7^',{U,{qs) - Us{Ps)))]ds. 
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Since Qs remains bounded on [ti,T2] and rf has compact support, combined with the property that 11^ is 
uniformly Lipschitz in the second variable on compact intervals, we can find a constant C > 0, such that 

This, the choice of our function 77, the fact that the controls Ui, i — 1,2 are bounded, and that qs is bounded, 
yield, in the first place, that e^^P^rf^ is bounded, and also that for some constant C > 0, 

To check this, one can reason as follows. For qs < ps, both sides are 0. For e > qs — Ps > with small 
e, the first term is non-negative and the second one is bounded below by —C{qs — Ps)^ which is again a 
constant times rjs . For qs — Ps > the inequality follows easily as the left hand side is bounded below by 
some negative constant and rjs is certainly bounded below by . 

Summing up the findings we get that 

Jti 

Since a is the first time Ps = qs, it follows that, l[ri,c7] {s)ris = rjs/\a, consequently, for some C > 0, 

KiVrAa] > E[r?ri] ~C f E[7]sAa]ds for T G [ti, Ta]. 
Jti 

At this point an application of Gronwall's inequality results in 

E^A^le^^"-"^) > E[7^rA > 0, 

where the hypothesis qr^ > Pn with positive probability is translated into the last inequality For r = T2 
we obtain £[7/^-2 act] — ^[77^2, cr > T2] > and therefore we conclude that {a > T2} has positive probability; 
stated otherwise, the probability that qr2 > Pt2 is positive. 

One technical word is in place here. Namely, the definition from (22) is in the sense of local martingales, 
but during the proof we look at ri{qT —pr) and this is actually a true semi-martingale not merely a local one. 
This is indeed due to the boundedness and continuity of the quantities involved, namely e~^"r(s, e~^''=?7" 
and the controls ai,i = l,2. □ 

Next, we solve equation (23) for each of the three cases described in equation (21) (this is straight- 
forward, as the resulting ODEs are separable). For ease of reference, we will label the resulting equations 
as ij) with super- and sub-scripts indicating relevant parameters. In the case r = 1, we have that 

BI{t) = ^ log (1 — ce^^^) for some constant c G (—00, 1). 

The choice of c allows any initial condition. Note that c — gives the constant solution Bq{t) = 0. For any 
c, as T — > 00, we see that B^ (t) 0. The case r ~ gives 

B^ (r) = c for some constant c £ M. 

Obviously, the choice of c allows any initial condition. (This is perhaps a bit pedantic, but we include it for 
the sake of completeness.) Finally, r = — 1 gives 

B~^{t) = ^ log (1 — ce^'^) for some constant c € (—00, 1). 

Again, the choice of c allows any initial condition, and c = gives the constant solution B^^{t) = 0. 
This time, though, if c 7^ 0, then the solution heads to ±00 as t increases (in finite time for negative initial 
condition, and as r — > 00 for positive initial condition). 

Continuing, we want to use the previous lemma and a judicious choice of the parameter c to bound the 
reachable set at time t. Recall that Pq gives the initial metric go and serves as the target in the target problem 
(and which as a section we write as r(0)). The the assumption that 50 arid h have the same area implies 
that maxj;gAf pQ(a;) — a> and that niin2:gA/ Pol^) = ^ 0- Further, if either a or is zero then both are, 
meaning that Pq = and go is just h. 
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The logic of the proof of the following theorem explains why solutions Qt of equation (23) are called 
barriers, in this context. 

Theorem 6. Consider the target problem (for the normalized Ricci flow) where h corresponds to one of the three 
constant curvature cases as discussed above (and with a and (3 as just described). For any t>0,we have that 



and 

inf p 2 

(x,p)ev"{t) 

(If 13 = 0, we set -\ log (l - e^^) = oo.j 

Proof. We start with the upper bound in the r — —\ case. We consider some fixed but arbitrary i > 0. Let 

c' = e-2t (i „ e2"). Then 

B-^{t)^OL and B,-;i(0) = ^log(l-e-2*(l-e2a^)^ 

Thus, by the previous lemma, if we start from a point (a;o,po) withpo > i?jJ7^(0), we have that pt > B'^,^{t) = 
a with positive probability (for any controls). By the definition of a, this means that pt is not in the target 
with positive probability. Since this holds for any controls, it follows that {xq,po) is not in the reachable set 
at time t, which we recall we denote ^"(t). This implies the upper bound on sup^^ p)eV"(t) P given in the 
theorem. 

For the lower bound in the r = — 1 case, consider c' = e~^* (l — e^'^). Then 

B-\t)^(3 and ^-^(O) = ^ log (l - e-^* (l - e^^)) . 

Analogously to the argument for the upper bound, the previous lemma implies that no point {xo,Po) with 
Po < B~,^{0) can be in T^"(<). This implies the desired lower bound. 

For the r — case, analogous arguments apply using c' — a for the upper bound and c' — (3 for the 
lower bound. 

Finally, we consider the r — 1 case. The upper bound is proven just as in the K = —1 case, using 
c' — e^* (l — e^"). The proof of the lower bound is similar, except that if i > — | log (l — e^^), we have 
that Bl{t) > f3 for any choice of c G (— oo, 1). Thus, these arguments do not produce any lower bound 
for inf p)gyre(() p in this case. On the other hand, if t < — | log (l — e^*^), we can let c' = e^* (l — e^^) and 
argue just as before. □ 

In light of the verification theorem, these conclusions can be restated in terms of p^. Namely, we can 
replace sup^^. ^^gyn^^) p in the above theorem with ma.XxeMPt{x) and inf(3,.p)gyn(t) p with miuxeMPtix)- 
Nonetheless, there is a reason to state the theorem as above. Suppose we consider the same target problem 
(or problems, since there are three cases), except that now we allow the target to be any (non-empty) closed 
set r such that maxr p = a > and maxr p — P < 0, rather than just a smooth section corresponding 
a metric go on M. Then we can still ask about the reachable set at time t > 0. Assuming that it is non- 
empty, the bounds in the above theorem still hold (with the same proofs). This shows that these bounds 
don't depend on the verification theorem and the resulting connection with PDEs, or on the structure of 
the reachable set, such as its smoothness or whether it's a section. (Moreover, similar methods could be 
employed even if a and /3 weren't assumed to be non-negative and non-positive, respectively.) It is this 
sense in which we refer to them as "a priori bounds." Of course, it is likely that these bounds are only 
interesting in light of their connection to the Ricci flow, as given by the verification theorem. 

We close this section with some easy observations about this theorem. First of all, if a = 0, then 
supj-^ p)ev'"(t) P ~ for all t > 0, and this holds in all three cases. Similarly, if (3 — Q, then inf p)gv'"(t) P = 
for all t > 0, in all three cases. Since one of a or /3 being zero implies that both are, we conclude that if 
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either a or /? is zero, the reachable set only contains points with p = 0. On the other hand, every point 
with p — will clearly be in the reachable set (just let the controls be identically zero). Thus we will have 
y (f) = {p = 0} for all t > 0. This corresponds to the basic fact that if go is already a metric of constant 
curvature, then it is stationary under the normalized Ricci flow. 

In the case when a and (3 are not zero, we see much different behavior for the cases of the three different 
curvatures. For r = — 1, the bounds improve as t increases, which we will see makes this the easiest case to 
deal with. For r — 0, the bounds are constant. Finally, for r = 1, the bounds get worse as t increases, and 
the lower bound even ceases to exist in finite time. This corresponds to the well-known observation that 
the case of the sphere (or projective space) is the hardest case to handle for Ricci flow on compact surfaces. 
Remark. It's worth point out that the above argument from Theorem 6 is overkill in the r — case, since then the 
lemma follows directly from the fact that pr is a martingale and martingales have constant expectation. 

We finish this discussion with the following useful Corollary which plays an important role later on. 

Corollary 7. For the case ofr = —1, or equivalently, the case xiM) < 0, the solution pt of the normalized Ricci flow 
converges to uniformly in the C°-norm exponentially fast as < — >■ oo. 

The same arguments work in the case of unnormalized Ricci flow. We record this here as follows. 

Theorem 8. For the unnormalized Ricci flow, as long as the stochastic target is well defined up to time t, 



The only thing we should point out here is that there is a blow-up in finite time for the case of r = 1 and 
there is also a blow up in finite or infinite time for the case of r = —1. This recovers the blow-up results in 
the previous section, only this time we used uniformization. 

Remark. This theorem shows that for the unnormalized Ricci flow, in the negative curvature case, the flow does not 
blow up in finite time, at least in the C" topology. This is already a good indication that the solution is defined for all 
times and corroborated with the above Theorem shows that the flow blows up at infinite. Thus, this result is probably 
a better result (in the case of negative constant curvature case) as the one obtained in Section 4. 



For the remainder of the paper, we assume that we have a smooth initial metric and a smooth solution 
to the normalized Ricci flow for all time (which we do since the initial conditions are smooth on a compact 
surface). We are interested in studying the convergence to the constant curvature limit according to the 
stochastic framework we have been developing. 

We consider the cases of zero Euler characteristic and of negative Euler characteristic, and we work 
relative to the underlying metric of constant curvature, as in the previous section. The positive Euler char- 
acteristic case (the sphere or projective plane) is well-known to be more difficult. This is largely due to the 
fact that there are many constant curvature metrics in any given conformal class, so that it's not clear in 
advance which one will be the limiting metric under normalized Ricci flow (this is related to the issue of 
solitons). As a result, we don't pursue this case. 

We are assuming that we have a smooth solution to the normalized Ricci flow for all time. This means 
that the reachable set is always a smooth hypersurface transverse to the vertical fibers. From now on, we're 
only interested in the successfully controlled process, so for notational simplicity we will let (xt, Pr) always 
denote that process (that is, what we previously denoted IV = Yr{A)). Moreover, if pis the smooth solution, 
we see that pr = pt-rixr)- One consequence of this is that we can generally restrict our attention to the 
Xr process. In particular, if we wish to couple two copies of the successfully controlled process (so that 
they meet as quickly as possible), it is enough to couple the marginals, since if the processes meet on the 
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manifold, then they also meet on the fiber. In this sense, what we are doing is equivalent to just considering 
Brownian motion on the underlying time-varying manifold, and so we see again that running a Brownian 
motion along the solution flow (and employing the stochastic techniques that apply in that situation) is 
subsumed by the more general construction of the stochastic target problem. 

A significant part of our results on the long-time convergence of the normalized Ricci flow is based on 
coupling two copies of the marginal process on M, which we denote by x^- and Ur- Recall that x^- will be 
time-changed Brownian motion on (M, h), with the time change given by integrating a = 2e^^^ along the 
paths, and analogously for y.,-, where we let b denote the instantaneous time-dilation (this is one significant 
advantage to working relative to this fixed metric). Note that we've incorporated the \/2 normalization 
factor into the time-change, so that we really do have Brownian motion with respect to h as the underlying 
object. This makes the stochastic analysis look a bit more standard. 

We wish to implement the mirror coupling for Xr and Ur, where the mirror map is with respect to the 
fixed h metric. Viewed in this way, this is a fairly straight-forward variant of the mirror coupling for two 
Brownian motions on a smooth (non-varying) Riemannian manifold. We simply generalize to allow our 
processes to be Brownian motions up to a random but smooth (in terms of the particle's position in space- 
time) time-change. References for the standard (non-time changed) construction are [ ] and [11], and 
we proceed by modifying this as necessary and by not belaboring the aspects which carry over without 
modification. 

Note that, since we're working only in the cases of non-positive Euler characteristic, a (and thus also b) is 
bounded above and below by positive constants (depending only on the initial metric) for all time, by the 
results of the previous section. 

First, let Cm be the subset of M x M consisting of points {x, y) such that y e Cut(a;) (which is equivalent 
to a; S Cut(j/)), and let Dm be the diagonal subset of M x M. Then let Ej^i be M x M minus Cm and Dm- 
Note that the distance function dist(a;, y) is smooth on EMr and that the direction of the (unique) minimal 
geodesic from x io y \s smooth on Em- Let [x, y) e Em', then the mirror map is the isometry from T^M 
to TyM given by reflection along the minimal geodesic connecting x and y- We see that the mirror map is 
smooth (on Em, which is where it is defined). As result, there is no problem in running the mirror coupling 
as long as the joint process is in Em - That is, for one-dimensional independent Brownian motions and 
W"^, consider the system of SDEs 



where ^'r ~ ^(x^., j/^) = mx^.y^z{xT-)e{yT)~^ with rrix.y being the mirror map, namely parallel transport 
followed by reflection with respect to the perpendicular to the geodesic from x to y- Then the coefficients 
are smooth in both space and time, so the system admits a unique strong solution, up until the first time 
the process leaves Em- 

The point of the coupling is to get the particles to meet, so we turn our attention to this issue next. First 
note that the marginals Xr and yr are time-changed Brownian motions as desired, so we're coupling the 
right processes. The natural object of study is the distance between the particles, with respect to the fixed 
metric h. We denote this distance by pr- It is a (continuous, non-negative) semi-martingale, so we derive the 
SDE that it satisfies by Ito's formula. This is the standard computation with the factors of a and b included, 
so we'll be brief. For more on this, see Section 6.5]. 

The martingale part is easily seen to be (a + b) dWr for some Brownian motion WV, whether we're in 
the r = or r = —1 case. (In what follows, we use WV to denote some Brownian motion, which may 
change from appearance to appearance, in order to more conveniently describe the SDE satisfied by a given 
process.) As for the drift, the only contribution comes from the second derivative of the distance with 
respect to the diffusions perpendicular to the geodesic from x to y, which is computed in terms of the index 
of the appropriate Jacobi field along the geodesic from x to y- We now summarize the computation. 




2 
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br Y.^rV^{yr)] O dWl 
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Let 7 be the unique minimal geodesic from x to y (parametrized by arclength), and let u be a unit vector 
field along 7, perpendicular to 7 (this determines v uniquely up to sign, and either of choice of sign is fine). 
Then we want the Jacobi field J{s)v{j{s)) where J : [0, p] M satisfies 

J + rJ ^ 0, J(0) = a, J{p) = b. 

When r = 0, the solution space to this differential equation is spanned by 1 and s. Taking the boimdary 
conditions into account, we see that the solution is 

6 — a 

J(s) = a -\ s. 

P 

Similarly, when r = —1, the solution space is spanned by coshs and sinlis, and the boundary conditions 
give 



b ~ a cosh p . 
a cosh s H — smn s. 



The index of each of these Jacobi fields is given by 



sinh p 



- rJ^ ds = J{p)j{p) - J(0) j(0), 



where the right-hand side is obtained from the left via integration by parts and the differential equation 
satisfied by J. Thus for r = 0, the index is 



P 



and for r = — 1, the index is 

h 



• 1 /, 1 \ coshp 

a smn p + ( — a cosn p) — - — 
smn p 



b — a cosh p 
sinh p 



b^) cothp- 2ab— 



sinh p 

2 P 
{a — b) coth p + 2ab tanh - . 



Putting this together, we see that 

(a + b) dWr 



dpr = 



(a + b) dWr 



p 



dr 



(a — 6) coth p + 2afe tanh i 



for r = 0, 
dr for r = —1. 



As mentioned, this holds until the first exit time from Em- Following the reasoning in [ , Section 6.6], one 
can show that Wr — — X]?=i(^i(^T), 7t(0))c?W^^ where 7r is the minimal geodesic joining Xt and j/r starting 
at Xr and running at unit speed. We will come back to these expression in Section 9. 

When the particles meet, we've achieved our goal, and we can either stop the process, or allow it to 
continue to run as x^- — Ut- Either way, there's no problem caused by the process hitting the diagonal. On 
the other hand, we do need to find a way to continue the process past the first hitting time of the cut locus. 
Showing that this is possible constitutes the proof of the following theorem. 

Theorem 9. Let M — {M, h) be a compact surface of constant curvature or —1, and let a — a{x, r) and b = b{y, t) 
he as above. Then there exists a process [xr^yr) on M x M, started from any (xq, j/o) ^ D and run until the first 
time hitting time of D, such that 

(1) The marginals Xr and are time-changed Brownian motions, with times changes given by a and b, respec- 
tively. 

(2) The distance (relative to h) between Xr and y-r, denoted p^, satisfies the SDE 



(25) 



dpr = 



(a + b) dWr + \ 
[a + b) dWr + 5 



p 



dr — Lt 



(a — b) coth p + 2ab tanh 



dr 



for r = 0, 
Lr for r = — 1, 



where Lr is a non-decreasing process which increases only when {xr^yr) G Cm (and the set of t for which 
(xTTyr) € Cm has measure zero almost surely). 
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Proof. As mentioned, the only issue is extending the constiuction mentioned above past the first hitting 
time of Cm- As usual, we proceed by approximation. 

Choose small, positive 6. Until the process comes within distance 6 of Cm (in the product metric on 
M X M), we run the mirror coupling as above. When the process hits distance 6 from Cm, at time n, we 
start to run Xt and yr as independent (time-changed) Browmian motions. This continues until the process 
is distance 26 from Cm, at time T2, when we again run them under the mirror coupling. We continue this 
procedure, so that we have a joint process {x^, y^) which evolves under the mirror coupling on intervals of 
time [t2„ ,T2n+i) and as independent processes on intervals of time [t|„_ ^ , t|„ ), for non-negative integers n, 
where the are the alternating hitting times of the S and 26 level sets of the distance from Cm- 

It's clear that and y^ are time-changed Brownian motions as desired, and that the satisfies the 
desired SDE when {y^ — x^)r is distance more than 26 from Cm- It's also clear that when x^ and y^ are 
being run independently, satisfies an SDE of the form 

dpi = u dWr + V dr — Lt, 

where u and v are bounded (with bound depending only on M and the bounds on a and 6) and Lt is 
a non-decreasing process which increases only when (x*, y^) G Cm (again, see the references mentioned 

above). 

Suppose we show that, for any t > 0, the expected amount of time on the interval [0, t] that the process 
spends within distance 26 of Cm goes to zero with 6. (Thus the amount of time the particles spend being 
run independently goes to zero almost surely.) Then letting 6 go to zero, we know there is at least one 
subsequence along which the process {xl,y^) converges to a limiting process {xr,yT) (by compactness). 
That this limiting process satisfies the first property in the theorem is immediate, since and yf do for all 
(5 > 0. For the second property, note that the contributions from the u dWr term and the v dr term go to 
zero by the boundedness of u and v and the fact that the expected length of time over which these terms 
are integrated goes to zero. It follows that the martingale part and the "regular" part of the drift come 
entirely from the SDE for p induced by the (mirror) coupling, and that the time spent at Cm has measure 
zero. Finally, the contribution converges to a term L-r as indicated. 

Thus, to complete the proof, we need only show that the expected amount of time on the interval [0, t] 
that the process {y^,x^)r spends within distance 26 of the Cm goes to zero with 6. This is the most tedious 
part, so we will tiy not to belabor it. Also, to simplify matters, we will take advantage of the structure of 
M. In the r = case, we can think of M as modulo some lattice, or equivalently, as a solid parallelogram 
with opposite sides identified. Then for fixed x, the cut locus is just the identified sides of the parallelo- 
gram (centered at x), which means that it is just two intersecting closed geodesies. This implies that Cm 
consists of two intersecting smooth (compact) hypersurfaces in M x M. The advantage here is that we can 
understand the time spent near Cm in terms of the time spent near each of these two hypersurfaces surfaces 
individually. This is nice because, in some (possibly local) tubular neighborhood of such a hypersurface, 
the (signed) distance to the hypersurface is smooth. 

The r = — 1 case is analogous. Here M is the h5^erbolic plane modulo an appropriate group action, or 
equivalently, a polygon in the hyperbolic plane with sides identified. So for fixed x, the cut locus is the 
union of a finite number of intersecting closed geodesies, and Cm consists of a finite number of intersecting 
smooth (compact) h5^ersurfaces in M x M. 

In either case, it's clearly enough to consider one smooth hypersiirface component of Cm, which we 
denote E, and prove that the time spent within distance 26 goes to zero. We fix some 6o > such that 
the signed distance from S, which we denote ^, is smooth on a neighborhood of the closure of the (Jo- 
neighborhood of E. (We will later also assume that 6o is small enough to satisfy one other condition.) 
Now the SDE satisfied by = ^t, for |^| < 6o, depends on whether the joint process is being run under 
independence or under the mirror coupling (of course, depends on 6, but we'U suppress this for ease of 
notation). 

Under independence, the bounded geometry and non-positive curvature of M implies that satisfies 
an SDE of the form 

d^r = u dWr + vdT ioT \^\ < Sq, 



22 



NEEL AND POPESCU 




Pt 

Figure 1. The configuration of the joint process relative to the cut locus. 



where u and | v \ are bounded and u is bounded from below by a positive constant (with all of these bounds 
depending only on M, So, and the bounds on a and h). Under the mirror coupling, ^t- satisfies an SDE of the 
form 

d^T = u dWr +vdT for < |^| < 6q. 

(Note that the process never runs under the mirror coupling when ^ = 0.) Here, the bounded geometry 
and non-positive curvature of AI again implies that u and \v\ are bounded with bounds depending only 
on M, Sq, and the bounds on a and b. We further claim that u is also bounded from below by a positive 
constant depending only on M, Sq, and the bounds on a and b. This is the key fact, and it uses the particular 
structure of the coupling. We now establish this claim. 

It's easiest to visualize what's happening by looking at a horizontal slice of M x M. The instantaneous 
picture is given by Figure 1, which illustrates the mirror map for Brownian differentials obtained from 
{dW^ , dW^) by reflection and the minimal geodesies from y to x and the slice of E (which we denote Eo 
and which is a piece of Cut^;), from which the gradient of ^ and its interaction with the generator of the 
process are determined. Let A be the distance from y to Sq. By symmetry under interchanging x and y, the 
symmetry of being in the cut locus of a point, and the product structure on M x M, we see that ^ = ±X/^/2, 
with the sign coming form the fact that ^ is a signed distance. 

Since the martingale part of dS^r depends only on the first order structure at a point, we see that we 
can consider the horizontal and vertical components separately (and that the martingale part is the same, 
infinitesimally for both the K = and the K = —1 cases). If we let (p be the angle between the minimal 
geodesic from a; to y and the minimal geodesic from y to Sg, then the martingale part of A relative to the 
evolution of yr (with x temporarily fixed) is — b cos (pdW^ — b sin ipdW^. As we see from symmetry, the 
martingale part of A relative to the evolution of Xt is —a cos ip dW^ + a sin ip dW^. Combing these, we see 
that the martingale part of d^t is 



±^ [-{a + b) cosipdW^ + (a - 6) sinpdW^] , 
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and thus, using that and are independent, we have 

u = {a + bY cos^ (y9 + (a — fe)^ sin^ ip 

for an appropriate choice of W in the SDE satisfied by ^r- 

Next, we need to show that Lp is bounded away from 7r/2 (on the set {0 < |^| < <^o})/ with the bound 
only depending on M and 5q. To do this, consider the behavior of ip as we let j/r approach a point in Sq. In 
the limit, the geodesic from to yr cannot be tangent to Sq, since for surfaces of non-positive curvature 
the geodesies from x are transverse to Culx- Thus, the corresponding limit of Lp is strictly less than 7r/2. By 
compactness (and continuity of ip), all such limiting values of p are bounded away from 7r/2, with bound 
depending only on the geometry of M. Now p varies continuously as approaches any point in Sg, so 
again by compactness, p will be bounded away from 7r/2 on any sufficiently small neighborhood of Eg. 
Thus, by assuming that 6^ is small enough (this is the other condition on (5o mentioned above), we see 
that p is bounded away from 7r/2 (on the set {0 < \S\ < Sq}), with the bound only depending on AI and 
Sq. Continuing, since a and b are bounded below by a positive constant, the claim that u is bounded from 
below follows. 

Thus the SDE satisfied by switches between these two possibilities at the stopping times . Using 
the bounds on the coefficients of the SDEs just given, it is a standard exercise in stochastic analysis to show 
that the time, over t G [0,t], that spends in the interval [—2(5, 2S] goes to zero with S almost surely, and 
we omit the details. As noted, this completes the proof. □ 

7. Convergence of first order to constant curvature in the case x{M) = 

Now that we have our uniqueness / verification theorem and the general coupling procedure, we begin 
exploring some of the consequences. As usual, for simplicity, we assume that we have a smooth solution 
for all time t > on the manifold M. We take here a flat metric h, which is possible under the assumption 
that x{M) = 0. 

The main result of this section is the following. 

Theorem 10. For M, h, and po cis above, suppose that we have a smooth solution to Equation (9) for all i e [0, oo). 
Then there exist constants, c, C > which depend only on the metrics go and h such that 

(26) sup |ft(a:)| < ce-*^*. 

xeM 

Proof. Fix a time t > 0, a time s e [0, t) and a point x e M so that the Ricci flow has a solution on [0, t]. 
The first thing to notice is that pr = pt-rixr) is a martingale. Thus we have the following stochastic 
representation 

(27) p,ix)^E[p,_^ix,)] 

valid for any stopping time a with < a < t. In particular, setting a ^ t shows that p^ [x) is a weighted 
average of the values of pq. Thus 

(28) minpo < minp, < maxp, < maxpg 

M M M M 

for any t. The main idea for getting (26) is to prove that for some c, C > 0, 

(29) oscpt < ce~". 

Indeed, if this is true, then combining this with the fact that the integral of e^^* with respect to the volume 
induced by h is 1, we deduce that there is at least one point x for which pt {x) = and from here it is clear 
that we get (26). 

We now choose any two starting points x and y for the processes x^ and y^. Over each of these points, 
there's exactly one point (j>i{x) and Pi{y)) in the fiber which is in the reachable set Ff. We wish to run the 
controlled process starting from both {x.pi{x)) and {y,p^{y)), and couple them so that they meet as quickly 
as possible. Our reachable sets have the semigroup property, i.e. the process [xr^Pr) at time t e [0, t] is on 
Vt-Tf and since we know that we have a solution until time t, we know that after running the controlled 
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processes for time t < t they will be on the solution section corresponding to the Ricci flow at time t — t. 
This means that if the particles couple on M, they couple in the total space as well, that is, Xr = Ut implies 
i\\aip^_^{xr) = Pt-Ayr) as well. 

In light of this, if a is the coupling time of and y^, the martingale property gives that 

pt{x) -Pt{y) = ¥.[pt^„/\s[Xcr.s)] ~ E[ft-<TAs(y<TAs)] 

(30) ^E[pt-a{x„) -pt-a{ya),Cr < s] + E[pt-siXs) -pt-s{ys),S < a] 

= HPt-s{xs) -Pt-s{ys),s < cr]. 

The outcome of this is that 

(31) oscpt < P(s < (t) oscpt-s- 

What remains to be controlled here is P(s < a). While the above is true for any coupling of Xr and y^, 
we wish to use the mirror coupling, as was introduced in the previous section. The main property of 
this coupling, for us, is contained in (25) which gives the equation satisfied by the distance function = 
d{xr,yT), namely 

(32) dpr = (a + b) dWr + —(a-bfdr- Lr 

with a,- = e~P''^^^^^\ br — e"*'*"^'^^-' and being a one dimensional Brownian motion on the time interval 
[0,t]. Obviously the time t runs up to cr (the hitting time of 0) or t, whichever comes first and the term Lr is 
non-negative. We are interested in estimating the probability this hitting time a occurs after time s. To this 
end, the first thing which will be used here is the fact that from (28) we know that a and b are all bounded 
from above as well from below. So we have two constants A,B>Q which are depending only on pq, or 
otherwise the starting metric go, with the property that 

(33) A<a,b<B. 
To move on, we let 

r 1 

be the time-change making the martingale part of pr from (32) into a Brownian motion. Then with the 
notation p„ = px^u), 

(34) dp„ ^dWu + ^ ^" 7 ^2 - 

2pu [a + by 

where a and b are evaluated at time X{u) and the above equation is valid for u E [Q,tAX^^{t)), where A^^(t) 
is the first value of u corresponding to A(u) = t. Obviously cu < X{u) < Cu for some constants c, C > and 
also because of (33), 

a-b B- A 

r < 7^ 7 = l-<5< 1. 

a + b ~ B + A 

Ignoring the L term in (34) and then using standard comparison for ordinary stochastic differential equa- 
tions, we learn that the process p is bounded above by a Bessel process of index S < 2 and starting at some 
value po bounded by the diameter (with respect to the metric h) of the manifold M. Thus, invoking [ , 
Equation (15)] which gives the distribution of the hitting time (t of for a Bessel process of index S < 2 
starting at p, we obtain 

Finally, since cu < X{u) < Cu and the diameter of the manifold M is finite, we arrive at 
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where D is a constant which depends only on the initial metric go and some geometry of the underlying 
metric h (more precisely the diameter of M with respect to h). Hence it turns out that the function A is 
determined by the metrics h and 50- 

To summarize, from (31) and the preceding we now have that 

oscpt < A{s)oscpt-s. 

Using this, it is easy to get (29) as follows. For t e [0, 1], we know from (28), that oacpt < oscpo- Now for 
each t E [n, n + 1], n > 1, using repeatedly the above inequality, we arrive at 

oscpt < A(l)"oscpt_„ < A(l)*oscpo/A(l) 

which is exactly the exponential decay of (29) since < A(l) < 1. □ 

Remark. It is interesting to point out that we can prove the same exponential decay as in Theorem 10 for the case of 
x(A/) < using the coupling argument. This decay is, however, already taken care of by the a priori estimates of 
Corollary 7. Nonetheless, this coupling argument is the one we will employ for the gradient estimates in the following 
section. 



8. Estimates on the gradient decay of the normalized Ricci flow in the case x(A/) < 

We continue under the same assumptions that AI is a compact surface with reference metric h of constant 
curvature or —1 (so M has non-positive Euler characteristic by the Gauss-Bonnet theorem) and go is a 
smooth initial metric in the same conformal class and with the same area as h, so that the normalized 
Ricci flow has a smooth solution for all time which is given by pt- Now, pt converges in the C°-norm 
exponentially fast to as shown in Corollary 7 for the case x{M) < and Theorem 10 for the case x(M) = 0. 
So we have that for some constants c, C > 0, 

(35) sup |ft(a:)| < ce-*^*. 

Let 

Gt = sup \\7pt{x)\ 

The idea is to start with 

/V7- . N c\ r Ptilhix)) - ptix) 
(Vpt(a;),0 = lim 7 

n— >0 II 

where ^ is a unit vector in the tangent space at x and jt (x) is any curve started at x with initial speed ^. Then 
we use the coupling to estimate pt (x) — pt (y) for x and y close to one another Due to the non-linearity of 
the flow, the estimates coming from the above will still contain the gradient bounds, but in the end, letting 
X and y come close to one another leads to a functional inequality on Gt, from which we are able to derive 
the desired estimate. 

Theorem 11. If x(M) < then Gt goes to exponentially fast. As a consequence, pt converges to exponentially 
fast in C^. 

Proof. Pick two sufficiently close points x,y E M and some t > 0, and let pr = d{xr,yT) for < r < t be the 
distance (measured with respect to the time independent metric h) between the processes Xr and yr started 
at X, and y respectively. We are going to use mirror coupling for the processes x. and y. . Recall that the 
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equations coupling equations satisfied by {xt.Pt) and {yTjqr) are given by 

2 

e 



(36) 



dVr 
dpr 
dqr 



-Pr 



.i=l 
2 

Y,z,{yr)V2 o dWl 



1=1 

2 



,i=l 
2 



1=1 



l)dT 



l)dT, 



where r = or — 1 and W is the Brownian motion given by the mirror coupling. 

We consider a, the coupling time of x. and y.. From the fact that Pr + t /q^(1 — e~^P")c?u is a martingale 
and Pr — Pt-Tixr)/ we write. 



(37) 



Pt{x) -Pt{y) = E[ptAr - gtAT 



rE 



t/\T 



-2g„ 



)du 



for any stopping time r. The useful estimates we are interested in are estunates from above of pt(a;) —pt{y), 
and this is good if we assume that pt{x) — pt{y) > 0. This is always possible unless pt is constant in which 
case the gradient is 0, so there is nothing to prove then. Thus, assume that pt (x) —pt{y) > for some points 
X and y (which is the same as po > qq) and take a to be the first time u for which p„ = g„. With this choice 
of the stopping time, for any u e [0, a] we know that p„ > Qu, which thus means e^^P" — e^^^" < 0. This 
combined with the crucial fact that r < and the exponential decay of pt, implies that for any s £ [0, t], 

pt{x) - pt{y) < E[pc -qa,a<s]+ E[ps -qs..s<a]< ce^'^(*~^)p(s < a). 
The point is that if a is the first coupling time, of the processes x. and y., it is obvious that a < a and thus 

P(s < a) < P(s < (t) for any s e [0, t]. 

which in turn yields 

(38) pt{x)-pt{y) < ce-^^*~'''>¥{s < a) for any s e [0,^]. 

With this equation our next task becomes the estimate of P(s < a). 
From Theorem 9 we learn that the distance process pr satisfies 



(39) dpr < (e^f 
in the case r = and 

(40) dpr < [e-P^ + e-'i^)dBr 



-)dBt 



dT 



1 r 

2 



e"''")^coth/9^ + 2e" 



tanh - 



dT 



in the case r = — 1. Here Bt is a one dimensional Brownian motion run in the time interval [0, t]. 

So far, we have used this strategy of coupling in the proof of Theorem 10, in which, due to the singularity 
in the drift of the equations (39) and (40), we compared the distance function pr with a Bessel process. For 
the gradient estunates, we are going to remove the singularity based on the observation that 

Pt = Pt-r{xr) and similarly = pt-riyr)- 

The upshot of this is that the term e^^^ — e~'^ is in fact of order pr- More precisely, due to the boundedness 
of p. 



,^-p,-Av.)\ < CdiXr,yr) sup \Vpt-r{x)\ = CGt-rPr- 

xeM 
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Since pr < D, where D is the diameter of M, it is straightforward to show that either of (39) and (40) 
implies 

dpr < {e^"^ + e-''^)dBr + C(l + Gl_^)prdT. 
To go further from here, consider pT the solution to 

dpr - (e-P- + e-'>-)dBr + C(l + Gl_^)prdT. 

with the same initial condition po = d{x^ y) as Pr- Standard arguments (in fact a simple application of 
Gronwall's Lemma) give that 

Pt < Pt 

which results in the fact that the first hitting time of for p is less then or equal to the first hitting time of 
for p. Now if (7 denotes the hitting time of for the process pt 

(41) P(s < ct) < P(s < ct) for all s e [0,i]. 

Therefore the task now is to estimate the latter, and to do this we solve for p as 

with the notation /(t) = C(l + Gl_r), for < r < i. Consequently the first hitting time of for p is the 
first hitting time of —po for the time-changed Brownian motion /Q^(e~P" + e^'^^)e~ ■f^^^'^^dB^. In law, this 
is the same as the first hitting time of — po of i3c(T)/ with the time change 



c(t)= / {e^P" +e-i^-fe-^^o f'-^'^'^'dv. 
Jo 

Once again using the boundedness of p, we can find a constant C > such that 

c(t) > c'(t) C / e-^^o f^'^^'^dv for r e [0, t] 



Now, if (T-pg is the first hitting time of — po for the Brownian motion, then the hitting time of — po for 
Bc(t) is given by c~^((T_po A c{t)). This combined with (41) yields that 

(42) P(s < i A a) - P(s < c-i(a_,„ A c(t))) < P(c(s) < (7_pJ < P(c'(s) < a_p„). 

The distribution of (J-pg is actually well understood (see for instance the Remark after Proposition 3.7 in 
[21]), and its density is given by ^P" ^^ g-Po/i^^) on the positive axis, which results in 



2Trx-^ v27r Jo 

Going back to (38) and using the preceding, we conclude that for s e [0, i]. 



c'is) < a_pg) = / -^e-Po/(^^^dx = ^ / e-^'-/'dr. 
Jc'{s) y2T:x^ v27r Jq 



{x)-Pt{y) < ce-^'*~^) / e-^ '^dT. 



Pt 

from which, using the fact that d{x, y) — po and letting po go to 0, we fairly easily deduce that 

which we rearrange as 

At I e" ^0 At-^du^^ < ^g-ct fgj. ^ g t] with = CtGl. 
Jo 

From here the exponential decay of At is taken care of by the following Lemma. □ 
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Lemma 12. Suppose At is non-negative, depends continuously on t for t > and has the property that for some 
constants c, C > 0, 

(43) At [ e- At-^du^^ < ce^^^*-") for all s e [0, t] . 

Jo 

Then there are constants k,K > such that 

At < Ke~''*forallt > 0. 

Proof. For each n > 1, let m„ ~ s,npte[n,n+i] ^^id Af„ = s'^Pte[n-i.n+i] ^t- Notice that the exponential 
decay we are looking for is actually equivalent to to„ < Ke^'^" for large enough n. 

Now, for t e [n, n + 1] and s e [0, 1], we have t — s € [n — 1, n + 1] and therefore —At-u > —Mn, which 
combined with (43) yields, for t near the supremum of At on [n,n + 1], and eventually another constant 

c> 

nin / e^'^*^"(iT ~ rrin < e~™ for all large n. 

Jo 

which in turn gives 

1 — e 

Now, for each particular n, we have one of the following two alternatives: 

(1) Mn < e^^"/^, in which case it is clear that 

(#) ™„ < e'™/2. 

(2) il/„ > e^™/^, and in this case 1 — e^*^" > 1 — e^"^ ""^^ > ^e^"^/'^ for large enough n, say n > hq. 
From (**), it follows that rn„ < 2Af„e~'^"/^ < MnC^'^^^ for all n large enough, say n > ni. This 
inequality implies that 

(##) m„ < m„„ie~'^/^ for all n > rii. 

Indeed if the supremum of At on the interval [n — 1, n + 1] is the same as the supremum on [n, n + 1], 
then AI„ = rrin and this in turn implies A/„ = 0, in particular we trivially have (##)■ If the 
supremum of on [n — 1, n + 1] is the same as the supremum on [a — 1, n], this gives Af„ = m„_i 
and then (**) gives (##). 

Using these two alternatives we argue as follows. Assume that there is a large enough n2 such that 
< e^'="^/2^ Then an easy induction using the two alternatives above give that m„ < e-'="/2 for all 
n > n2- If there is no such n2, this mean that for all n > ni we clearly have the second alternative and in 
this case m„ < mn^e"""^/^. In both cases we obtain the exponential decay we were looking for. 

An alternative proof can be given as follows. Take a sufficiently large constant K > 0, which will be 
chosen later. Now we look at Bt = Ate''*. Assume there is a time t > K such that Bt is the maximum over 
the time interval [0, t]. We then have Ar < Ate^'''^ for r e [0, t] and from (43) with s = 1, 

At /'e-^^'^'^'^'dr <ce-'^* 



-Ate'' ^ k -Ct 

* < ce e 



and from this 

1 

which gives that 

At < -e"'^'log(l - ce'^e-'^*). 
If we choose the constant K large enough and k small enough, so that 1/2 < 1 — ce^e^'~^^ , then we arrive at 

At < ce-^' < Ce-''*. 

In particular this means that Ate''* < C. As this Bt is the maximum of Br over r e [0,t], we get that 
At < Ce~''*. The other alternative which remains is that there is no < > iiT for which Bt attains a maximum 
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on [0, t] for t > K. In this case we deduce that Bt has a maximum for t e [0, K] and the exponential decay 
follows again. □. 

Remark. As one can see we do not fully use the condition in (43) for all s G [0, t]. It suffices to have this hold for 
s € [0,1] and large enough t. 

Before we close this section let us point out that the exponential decay of the gradient has the following 
consequence that we will use later on for the estimates of the higher order derivatives. 



Corollary 13. Under the same assumptions as in Theorem 11, 




Proof. This follows by combining (41), (42), and the fact that c'(s) is bounded (due to the gradient estimate). 

□ 



9. Triple Coupling 

9.1. Basic idea. We have just used coupling to prove the exponential convergence of p to in the C^- 
topology. The next step in our analysis is the estimate of the decay of the Hessian of p, which, from the Ricci 
flow equation, implies the convergence of the curvature to a constant. The basic idea starts with writing 

where C is a unit vector at z, and 7 is a geodesic running at imit speed started (at t = 0) at z with velocity 
^. Now we are concerned with three points, x — ^{—po), y = "yipo), and the middle point z. As in the 
gradient estimate case, we want to write p{x), p{y), and p{z) as integrals of some functions of the associated 
Brownian motions and then use probabilistic estimates to find bounds for Ptini^Po)) — '^Pt{z) + Pt{l{po)) 
in terms of po- 

There is very little literature on this idea, though it certainly seems that this probabilistic tool is quite 
useful for estimating second-order derivatives for evolution equations. The only reference to this approach 
we're aware of is in [ ], where it is essentially used to estimate the Hessian of harmonic functions on 
Euclidean domains. 

To make this idea more precise, we will develop a mechanism of triple coupling (that is, a coupling of 
three particles, as opposed to just two). We will use mirror coupling for the processes corresponding to the 
particles x and y, taking them as time changed Brownian motions, as in the previous section. Now we wish 
to include a third particle, namely z, which we want to couple together with x and y. It is natural to want 
to have this "middle particle" remain on the geodesic joining the other two. We will see that this is possible 
(at least in the cases we're considering) if we allow it to evolve as time-changed Brownian motion, possibly 
with drift along the direction of the geodesic. 

Instead of starting with a time-changed Brownian motion with drift, Zr and then trying to figure out the 
time change and drift necessary so that it stays on the geodesic, we do it the other way around. Namely, 
since we want the particle z^ to move on the geodesic, we determine the conditions on the distance to one 
of the other points so that the corresponding point on the geodesic is a time-changed Brownian motion with 
a drift along the geodesic. For the purpose of the Hessian estimates, and in light of the gradient decay, this 
will be sufficient. 

9.2. Rigorous Approach. Assume we start with an arbitrary Riemannian surface M and that Xr, yr run 
as time-changed Brownian motions with the time changes a and 6, as above. The idea is that the middle 
point Zr on the geodesic joining Xr and y^ is completely described by specifying the distance pi from z^ 
to one of the ends, say x^- We use a mirror coupling of the particles Xr and yr and pi ^ will be described in 
terms of a real-valued SDE. In addition to pi, we will also consider p2, which in intuitive terms is just the 
distance from the middle particle Zr to yr- We are seeking several key symmetry properties which will play 
an important role in the economy of the Hessian estimates to follow. 
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In what follows, as always, fix a time horizon t > 0, and assume that a — a(T,x,y, pi, P2) and b = 
6(t, X, y, pi, P2) are two positive functions defined on [0, x M x M x [0, 00) x [0, 00), which will be time 
changes for the processes Xr and y^-- To describe this, again denote by m^.y : T^M — > TyM the mirror map, 
that is, the parallel transport along the minimal unit speed geodesic jx,y joining x and y (assuming that x, 
y are not at each other's cut locus) followed by the reflection about the orthogonal direction to the geodesic 
aty. 

The system we start with is the following 



(45) 



dXr = a(T) 



dyr = Ht) 



Y,^r[z,{y,)] odWl 



2 



dp^^r = -a(T) ^{u{Xr)nr{'^))dWl + a{T)dW^ + (3{T)dT 
1=1 

2 

dp2,r = b{T)J2{'i'Ac^{yr)],ir{l{T)))dW:. + a{T)dW^ + P{T)dT, 



where = rrix^ ,y^t{xr)t{yr) ^ is the reflection map acting on Ty^, 7,- is the minimal geodesic running at 
unit speed from Xr to yr, and is a one-dimensional Brownian motion independent of (Vt^^, W'^). As a 
notation, let Z(t) be the length of the geodesic 7t. Here we do not specify what the functions a, a, (3, (3 are as 
we will do this along the way, depending on the properties we want to reveal. They are defined, like a and 
b, on [0, t]x M X M x [0, 00) x [0, 00). The equations for pi and p2 can be thought of as the equations of the 
distances from the middle point Zr to Xr and yr, as indicated in the previous section, and also as discussed 
for the coupling in [ 15, Section 6.6]. Notice here an important point- since 

{^Myr),ir{l{T))) = -(e,(a;O,7,(0)), 

the last equation of (45) can be rewritten as 



(46) 



dp2.-r 



2 

i=i 



There is no problem with the existence of a solution for the system (45) (as long as the entries a, 6, a, /?, a, 
and (3 are smooth) up to the stopping time T, which is the first time t when pi^rp2,T hits or when d{xr,yT) 
hits a (small) rg smaller than the injectivity radius. This way we have a well-defined system and do not 
have to worry about the extension beyond the cut locus, as we did in the previous (two particle) coupling 
case. From now on, during this section we will assume that the time in the system (45) is run until T. 

The object of interest to us is the process {x,y, pi, p2)- It is clear that this is a diffusion, and it is a relatively 
straightforward task to determine that the generator of {x, y, pi, P2) is 



— A,„ 
2 " 



-a 



-di 



ab{mxyXi^i,Y2^i)XiiY2. 



- a'^{Xi^i,'jx,y{0))Xi^,dp^ - ab{Xi^i,jx,y{0))Xi^idp^ 

~ ab{mx,yXi^i,jx.,y{0))Xi,idp^ - b'^{mx,yXi^i,jx.,y{0))Xi.idp^ 
2 

+ {aa-abJ2{Xl,^,ixA0)f)^p^^p,+(3^p,+^^p„ 



with Xi,i, i — 1,2 being an orthonormal basis of T^M and Y2J, j — 1,2 an orthonormal basis of TyM. In 
fact, we can choose Xi^i = jx.y{0) and Xi^2 = G T^M, which is perpendicular to 7x,y(0). Similarly, 
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choose ^2,1 ~ iy.x^) and 1^2.2 = ^2 = fnx.yS,!, or said simply, the parallel transport of along the geodesic 
7x,y With these choices, the generator simplifies to 



+ „2 + o2 



= y A, + -Ay + -^^dj,^ + + aK7.,y(0)%,.(0) + 66) 

(47) - a^ix,y{Q)dp^ - a6%,:r(0)9pi 

- a67a;,a(0)ap2 - fc^7y,x(0)9p2 

+ (aci — ab)dp^dp^ + (3dp^ + $dp^. 

The first property we want to see is that pi+ p2 — p. This property is nothing but the geometric picture 
that pi is the distance from Zr to Xr while p2 is the distance between to Ur- 

To do this we recall that the distance pr between the mirror-coupled processes Xr and is given by 

(48) dpr = -(a(r) + 6(t)) ^(e,(a;,), >(0))dW^; + -I(T)dr 

where T is the index form of the Jacobi field J{t) along the geodesic 7r which, at the endpornts, has values 
aE and hE. We use the notation E for the parallel translation of 6 G T^M along the geodesic joining x and 
y. The index form is computed as 

I(J, J) = / I + (i?(7(u), JH)7H, JN)rfM, 



with /(7) being the length of the geodesic 7. Here the curvature tensor is the standard tensor curvature 
given as in [5] 

R{X, Y) = Vx Vy - Vy - V[x,y] • 

On the other hand, from (45), 

2 

d{pi,r + P2,r) - -(a(r) + fe(r)) ^(e,(x,), 7r (0))dVK; + (^(t) + 5(r))dVK3 + (/3(r) + ^(T))dT. 

1=1 

We clearly see here that pr and pi ^ + /02,t have the same martingale part if a = —a. The choice for (3 and /3 
is provided by the following result. 

Theorem 14. Assume that 
(49) 



a — —a 



/3(t, X, y, pi, P2) - i C l-^WI" + (^(tN. '/("))7("), 
1 rKl) 



[P{T,x,y,p^,P2)^\!li:i{_^JJ{u)\^ + {R{i{u),J{umu),J{u))d^ 

where J is the Jacobi field along the geodesic jfrom x to y and having values aE at and bE at 
If in addition, pi^ = p2,o = Po/2, then almost surely p^ — pi^r + P2,t- 

Proof. Take pi^r = Pr — P2,t- It is clear now that we have 

rPi.T rpi,T 
•^(pi.T — Pi,t) = / A[u)du — / A{u)du 
Jq Jo 

with 

M^) = I [\j{u)f + {R{J{u),j{umu),J{u))du 

From here, the fact that pi^ = pi^ (or pi — pi,o — 0) and standard application of Gronwall's inequality 
leads to pi,r = pi.r, which is what we want. □ 
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We return now to the case where the curvature is constant and start with [4, Lemma 3.4] which says that 

(50) R{X,Y)Z = -r{{X,Z)Y - {Y,Z)X). 

We should point out that the Do Carmo [4], takes the curvature to be given by the negative of the curvature 
we consider here, or for that matter other people. Then the Jacobi field equation becomes 

J-i?(7,J)7 = 

or equivalently, 

(51) J + rJ- r(7, J)7 = 0. 
Since this Jacobi field is perpendicular to the geodesic, it follows that 

'j + rJ^ 
J(0) = aE 
{j{l{^))^hE. 

The solution is 

(52) J(s) = {awi{s) + bw2{s))E{s) for s £ [0, ^(7)] 
where wi,W2 are defined on the interval [0, ^(7)] by the following odes 

{wi + rwi =0 I + = 

wi(0) = 1 and i ^2(0) = 

An integration by parts argument together with the equation of the Jacobi field and the constant curva- 
ture assumption reveals that 



\J{u)\^ - {R{J{u),j{u))j{u),J{u))du^ / \J{u)\'' ~r\J{u)\^du 

Jo 

= ( j(s), J(s)) - ( j(0), J(0)) = (awi(s) + bw2{s)){awi{s) + bw2{s)) - a{awi{0) + 6^2(0)) 
and 

- {RiJ{u),j{um{u), J{u))du = (J(/(7)), J(/(7))) - ( j(,s), J{s)) 

= a(awi(/(7)) + bw2{l{'^))) - (awi(s) + bw2{s)){awi{s) + bw2{s)). 

A direct consequence of these formulae and the fact that ^2(5) = ?«i(^(7) — s), plus a few elementary 
manipulations, results in 

1(1) 

+ aw2{s)){bwi{s) + aw2{s)) - a{bwi{0) + aw2{0))). 

lif)-s 

Summarizing, the choices of /3 and f3 from (49) in the case of constant curvature become more explicitly: 

/? = i {{awi{pi) + bw2{pi)){awi{pi) + 6^2 (pi)) - a{awi{0) + 6^2(0))) 
/3 = i {{bwi{p2) + aw2{p2)){bwi{p2) + aw2{p2)) ~ a{bwi{{)) + 0^2(0)))) . 



(54) 



It goes without saying that here a and b are evaluated at (r, x,y, pi, p2)- 

We say that a function /(r, x, y, pi, P2) is symmetric in pi and p2 if /(t, a;, y, pi, P2) = /(''', 2;, V, P2,Pi)- 
Before we move on to another property of the diffusion (x, y, Pi,p2), we close the discussion so far with 

the following property of the choices of (3 and f3 from (49) : 

(55) If a and b are equal and symmetric in pi and p2, then /3(r, x, y, pi, P2) — ${t, x, y, p2,pi)- 
A symmetry which plays a crucial role in the Hessian estimates is the following. 
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Theorem 15. If, in equation (45), we take 

a and a symmetric in pi and p2 
b — a 

a = —a 

i3{T, X, y, pi,p2) = P{t, X, y, P2,pi) 

Pl.O = P2,0, 

then the processes {x,y, pi, P2) and {x,y, p2: pi) have the same law. In particular, the processes {x,y,pi) and 
(x, y, P2) have the same law. 

Proof. Although this is almost trivial, we say a word about it. If C is the generator of a diffusion ujr on a 
manifold A4 and tt : ^ is such that for any smooth function (p : Ai ^ M., 

then imiqueness of the diffusion implies that w and Tr{Ld) have the same law. This can be easily seen from 
the martingale characterization of the law of the diffusion. We apply this to the operator C from (47) and 
the map n{x, y, pi,p2) — {x, y, P2,Pi)- The rest follows. □ 

Notice that (cf. (54)), the choices of /3 and j3 from Theorem 14 are actually consistent with the conditions 
of Theorem 15 under the assumptions that a and b are equal and symmetric. 
The "middle particle" process we are interested is 

(56) Zr = ■yx^,yAPhr)- 

The symmetry between pi and p2 should be interpreted as saying that the reflection of the process z^- 
with respect to the middle point of the geodesic jxr,y^ has the same law as itself. 

Our next objective is the law of Zt-. Before we jump into the heart of the matter, we take up a discussion 
on the following class of vector fields that play that are the main actors in our computation. 

Assume we have a geodesic 7 from a; to y with length I and consider a smooth, two-parameter geodesic 
perturbation / : (— e, e) x (— e, e) x [0, 1] — M of 7, i.e. /(O, 0, s) — 7(5) and for each fixed choice of u and v, 
the curve s f{u, v, s) is a geodesic. One of the things we want to imderstand is the field 

Let Jy{s) ~ -^f{u, V, s)\^_Q be the Jacobi field obtained by differentiating / with respect to v. Similarly 
let Ju{s) = '^f{'>J-TV,s)\^_^ be the Jacobi field obtained by differentiating / with respect to u. In order 
to determine the equation satisfied by /C, we recall here [4, Lemma 4.1], which asserts that for any two- 
parameter family g{a, b) and vector field V along g, 

(57) ^^V-^^V^^r(^,^]v. 

da db db da \ db ' da J 

Recall here that Do Carmo [ ] uses the curvature as the negative of what other people use, as for instance 
[5]. 

Now, what we want to do is find a differential equation satisfied by /C. As pointed out already, /C(s) = 
^Jv{s) and starting with (51) for J„, namely, 

-^Jv + rJv - r{'j, Jy)j = 
we take the derivative with respect to u to arrive at 

D ^ ^ / D . \ . , ,. ,. , , D 



34 NEEL AND POPESCU 

To move forward, use that j^ j— jg^l — Ju^o re-write the previous equation as 

duds^'^" + - r{IC,j)j - r{Ju, Jv)i ~ r{j, J^,) J„ = 0. 
Our task now is to commute the derivatives with respect to u and s. For this, use (57) and (50) to justify 



au as'^ as au as \ as du I ds du ds 



D D D f , ■ . I ■ . \ 

Now we once again employ (57), 

ds du ds " ds'^ du ^ ds \ \ ds ' du J ^ J ds i « " 

■■ D 

^ > -/C + r-((7, J„)J„- (J„, J,)7) 

ds 

= IC + r (^(7, jy)Ju + (7, Jv)ju - {Ju, Jv)i - {Ju, jv)i^ ■ 

Putting together (*) and (**) at u = 0, we obtain 

D D'^ ■■ / ■ ■ ■ ■ \ 

duds^'^'' = ^ + J' (^2(7, Jy)Ju + (7, Jv)Ju - {Ju, Jv)i - '2{Ju, Jv)ij , 

and finally, 

(58) it + rlC- fc(/C,7)7 + 2r ((7, j,) J„ - ( j„, J„)7 - (J„, j,)^^ = 0. 
The boimdary conditions imposed here are the natural ones 

The t5^ical example of the t5^e of perturbation /(u, s) that will appear below is given by the following. 
Take a geodesic 7 defined on [0, 1] and consider two geodesic curves, 771 (w) starting at 7(0) and another, 
ri2{v), started at 7(0- Then we take f{u, v, •) to be the geodesic run at unit speed from ?yi(u) to rj2{v). 

We discussed the case of a two-parameter perturbation of the geodesic 7 in the form /(m, f , s) but exactly 
the same argument works also for the case where f{u,s) is a perturbation with geodesies of 7, and we 
consider the field 

The main result from the argument above then gives that 

(59) A: + r/C-r(/C,7)7 + 2r((7,i,)J„-2(J„, j„)7) =0, 

with J„(s) = £/(u, s)\u=o- 

We are finally ready for the next result. 

Theorem 16. Assume that 

a{T,x,y,pi,p2) = a{T,x,y, pi, p2)wi{pi) + b{T,x,y, pi, p2)w2{pi) 

(60) ^' ^' = a;, y,Pi,P2) + r(^J (a(T, a;, pi, P2)wi(cr) + 6(t, a;, y, Pi, P2)w2{<j))'^da 

~T ("('^' 2^'^'^!' ^2)^1(0-) + 5(r, a;, y,pi,p2)u'2(o-))2(iCT 
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with wi and W2 defined by (53). With these choices, the process Zt = ^x.r,y^{pi,T) has the property that, for any 
smooth function ip on M, 



(61) 



is a martingale with respect to the filtration generated by W\, W2, and W3. Inside the integral, a{u) and 6{u) are 
shorthand for a and 9 evaluated at {u, a;„, j/„, pi^„, P2,m) In other words, z^ is a time-changed Brownian motion (with 
the time change given by a) with a drift in the geodesic direction from Xr to yr- 

Proof. The idea of the proof is to start with the generator of the diffusion (x, y, pi) and a function and look 
at the process ^p{zr)- More precisely, we find the bounded variation part of this. It is clear that, in terms of 
the generator (47), we need to compute the action of each term of this expression on Lp{'-^x.y{s))- Notice that 
the part which involves derivatives of p2 simply drops out in this calculation. 

For simplicity we will drop the dependence on r, x, and y in the notation and let I = d{x, y). Thus the 
geodesic ^x.y will appear as 7. Let E denote the parallel vector field along 7 which is obtained by parallel 
translation of ^1 . 

Now we take the terms one by one. Again for simplicity in writing, we will use s instead of pi as the 
parameter in the geodesic direction. 

(1) We write the Laplacian term as 

where 77^ 1 and 7711.2 are geodesies starting at x and having derivatives given by 770,1 = 7a:. ^(0) and 
^70, 2 = £,!■ Then we continue with 



Ax[(p{jx,y{s))] = 



Ax[ip{'yx,y{s))] = (Hess^(7(s))7(s),7(s)) + (Hess<^(7(s)) Ji(s), Ji(s)) 



(62) 



V¥'(7(s)),^7r,„,i,j/('S) 



L>2 



where Ji is the Jacobi field along 7 given by Ji(s) = 3^7r;„,2,y('S)l«=0/ which can also be character- 
ized as the Jacobi field with the boundary conditions Ji(0) — ^1 and Ji{l) — 0. On the other hand, 

if welookat/C(.s) = J[^7r;„,2,s/(s) 



, using (59) we obtain the equation satisfied by /C as 



ti=0 

a: + r/C - r(/C, 7)7 + 2r ((7, J,) J, - 2{J,, ^1)7) = 
/C(0) = 
JC{1)^0. 

Notice here that the boundary conditions follow from the fact that t;^ 2 is a geodesic and that 

7»)„,2,a(0 = y- 

Now, the Jacobi field Ji is given by 

Ms) = wiis)E{s) 
and this in turn gives the equation of K. as 

IC + r/C — r(/C, 7)7 = 4rwi7i;i7 



/C(0) 



-- 
0. 



We solve this as 



(63) 



/C = 7«i_o7 with 7«i o(s) = 2r / wl{a)da' 



2sr 

1^ 



wl{(T)da. 
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The conclusion is that 

(64) A,[(^(7,,^(s))] -(Hessvp(7(s))7(.s),7(.s)) +w;?(.s)(Hess¥>(7(s))£;(s),i?(s)) +z;;i,o(s)(V^(7(s)),7(s)>. 

(2) In the same vein, with very few changes, we can treat the next term, which is the Laplacian Ay 
applied to V9(7(s)). To this end, start with the following equation. 



(65) 



\Ml{s))] = (Hess^(7(s))J2(s), J2(s)) 



+ ( Vv3(7(s)),^7i;,»)„,i(*) 



M = 



u=0 



which is an analogue of (62) with the role of x being played hy y, rju.i a geodesic starting at y with 
initial speed given by 7(0/ and rju.2 a geodesic starting at y with initial speed ^2 = ^(0- Here J2 is 
the Jacobi field which is at and ^2 at L As in the case of Ji, we can show that 

J2is)=W2{s)E{s). 

The first term in the second line of (65) is because jx.j)^ 1 (s) does not depend on u. The second 
term can be dealt with in a similar way to that outlined above in dealing with the derivative of the 
Jacobi field. We skip the details and give the main result. Let 



ti=0 



(66) 



From (59), the equation satisfied by /C (with W2 given by (53)) is 

2sr 



K. = W0.17 with ?«o,i(s) = 2?' / 

Jo 



I 



wl{(T)da. 



Then we have 

(67) A,[^(7,,,(s))] =(Hess^(7(s))7(s),7(s)) +zi;2(s)(Hessv9(7(s))i?(s),i?(s)) +u;o,i(s)(V^(7(s)),7(s))- 



(3) For the next term, matters are fairly simple. Namely, because we are differentiating with respect to 
the geodesic parameter s. 



(68) 



(4) Next in line is 



(69) 



7a:, 1/ 

(0)7,,,(0)[(^(7.,,(s))]=0. 



This produces because 

%.a;(0)[(^(7x,y(s))] = 0, 

which follows from the fact that perturbing y along a curve 2 iri the geodesic direction of '~fx,y 
yields that Jx,7j,,,2 (s) = Jx.,y{s), and thus is independent of u. 
(5) Now we deal with 

Ci6[¥'(7x,y(s))]- 

To this end, consider the geodesies 77^ 1 and 77^ 2 which start at x (respectively y) and have the tangent 
vectors ^1 (respectively ^2)- What we need to compute is 



D D 

du dv 

If we let 



= {Ressip{-fx,y{s))Ji{s),J2{s)) + { f{lx,y{s)), ——^n^ ,^r,^ 2(s) 



D D 



U — V — {i 
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from (58), we obtain 

JC + rJC — r{JC, 7)7 = 2r{wiW2 + 'W2Wi)j 
/C(0) =0 



2sr 



which we solve as 

(70) /C = z/;i_i7 with wi^i(s) = 2r / wi{(t)w2{(j)(t — / wi{a)w2{<T)da. 

Jo * Jo 

We conclude that 

(71) ^.U^ij^A^))] = wi{s)w2{s){Ressip{-/{s))Eis),E{s)) +w,A^^ 

(6) Next is the following 

7..,(0)a,[^(7.,,(s))]=7(0)(V^(7(s)),7(s)) 

= (Hess(p(7(s))7(s),7(s)) + (V^(7(s)), 7(5)). 

Therefore, since 7 is geodesic, 

(72) 7(0)9,[^(7.,,(s))] - (Hess¥.(7(s))7(s),7(s)). 

(7) Now, 

(73) %,,(0)93[(p(7x,,(s))] =0, 

as can be easily seen from the fact that perturbing y in the geodesic direction (say, along rjy) reveals 
that 72;,,,^ (s) = 7x,i/(s), so that the derivative with respect to v vanishes. 

(8) The last term is easy to deal with and gives 

(74) 9.[^(7.,,(s))] = (V^(7(s)),7(5)>. 

Putting together all the results from (64)-(74) we arrive at (we drop the subscripts x and y) 

- ^(Hess^(7(s))7(s),7(s)) + (^^i(^) + ^^2(-^))^ (RessMsms), E{s)} 

(75) 

P+ ■ 2 " ] 

A little simplification follows from 

a^wifi + b^WQ^i + 2abwi^i = 2r {a{s)wi{T) + h{s)w2{T)fdT -j J (a(s)wi(r) + b{s)w2iT))'^ dr^ 

which then gives the result of the theorem, for the choice of a as in (60). □ 

We close this section with the following result summarizing all of the important findings for the next 
section. 

Corollary 17. Assume that the entries of (45) satisfy 

' a is symmetric in pi and p2 

b = a 

a = —a \ li + rw — 

aiT,x,y,pi,p2) ^ a{T,x,y,pi,p2)w{pi) with < u;(0) = 1 
l3{T,x,y,pi,p2) = ^a'^{T,x,y,pi,p2){w{pi)w{pi) - w{0)) [w{d{x,y)) = 1. 

P{T,x,y,pi,p2) = W{t, x,y, pi, p2){w{p2)w{p2) - w{0)) 



(76) 



2 

.Pl.O = P2,0 = Po/2 



Then 
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(1) + p2,T — Pt almost surely. 

(2) The diffusions {xr,yT, Pi.r, P2,t) and {xr^Ur, P2,t,Pi,t) have the same law. In particular, {xr, VtiPi.t) and 
[xr^Ur, P2,t) have the same law. 

(3) If Zr = lx^,y-^ {pi,t), then foY any smooth function (p on M, 

is a martingale with respect to the filtration generated by Wi, W2, and W3, where 

pi / 2/ 



Zu), jx^,y^ (Phu))) ] du 



&{T,x,y,pi,p2) ^ l3{T,x,y,pi,p2) + ra {T,x,y,pi,p2)[ / w {(T)da - — r/ w {a)da 

A word is in place here. The statement of Theorem 15 requires the symmetry of a with respect to pi and 
P2. This is not satisfied by the choice in (76) for arbitrary pi and p2. However, because of the choice of f3 
and (3 and Theorem 14, we know that (almost surely) pi r + P2.T = Pr - So it suffices to ensure the symmetry 
of a and a with respect to pi and p2 only in the case that pi + P2 ~ P ~ d{x, y), which follows from the fact 
that w{s) — w{d{x, y) — s) for s € [0, d{x, y)]. 

For a given I, the solution w to (76) is 

^ ' ' ' cos(Z^/f/2) 

In particular, for r = or — 1 and I bounded, w{s) is bounded and so are all its derivatives. 

10. Estimates on the Hessian decay for x{M) < 

For Euler characteristic less than or equal to 0, we know that pt and Wpt decay exponentially fast. Our 
goal is now to extend this to the Hessian of pt, resulting in the converge of the metric to the constant 
curvature metric in C^. In particular, the curvature converges to a constant. 

To estimate the Hessian decay, we proceed in a similar way to the estimation of the gradient, only that 
now we need to use a coupling procedure for three points rather than two. 

Let us denote, for t > 0, 

Ht = sup |Hesspt(x)|. 

What we want to show is that Ht decays to exponentially fast. 

Theorem 18. For the case xiM) < 0, Ht converges to exponentially fast as t — >• 00. 

Proof. To begin with, notice that 

(79) {Bessptiz)^, = lim j^^(7(^Po)) ^ 2p,(z) + ^,(7(^0)) 

Po->0 

where 7 is the unique geodesic passing through z and having the initial velocity given by ^. Thus, similarly 
to the case of the gradient estimate, we will use the three particle coupling to get a handle on the right-hand 
side of the above quantity, for sufficiently small po ■ 

For convenience, fix a time t > and let s E [0, 1 A t]. Pick two points x,y E M, with d{x, y) = po small 
enough, and let z be the middle point on the geodesic between x and y such that d{x, z) — d{z, y) — po/2. 
Consider the triple coupling described by (45) with the choices from Corollary 17. All the data there is 
completely described by the choice of the time change a of the processes Xt and yr- In this section we 
choose 

(80) a(T,x,y,pi,p2) = e-P'--(^-«) 

where Xx.y is the middle point on the geodesic between x and y. This choice does not depend on pi or p2, 
and consequently it is symmetric in pi and p2, as required by Corollary 17. Other choices are possible for 
the argument here, but we stick with this because it is symmetric with respect to x and y and makes some 
of the estimates look more natural. 
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Now, we consider Pt^aiza)/ where is defined in the previous section. Again rnvokuig Corollary 17, 
we learn that 

dpt-r{Zr) = Ml,r + {-dtPt-riZr) + a"^ (t) Apt-r{Zr) + 0{t) (Wpt-rir) , jr)) dr. 

where Mi t- is a martingale. From the Ricci flow equation, dtpt-rizr) — e^^^*"^*-^^' Ap(_T-(z7-) + r(l — 
g-2pt^T(zx)^ so we continue with 

(81) dpt-r{zr) = Mi,r + ((a'W - g-^^'-^^^^)) Apt-rizr) + 0(t) (Vft_, (t) , 7,) - r(l - e-2P'-(-^))) dr. 
For the semimartingale pt-rixr) we have from (45) and the Ricci flow equation that 

(82) dpt-r{xr) = M2,r + (g-^P'- ^ - 6"^^*-^ ^^^^ ^ ) Api_^ (.T^ ) - r(l - g-^P'-^^-)) dr 
where At- is the middle point of the geodesic joining Xr and yr- Similarly for pf_T.{y^), 

(83) dpt^riyr) = Ms.r + ((e-2p*-(^^) - e-^P'--^y-^)Apt-r{yr) " r{l - e-2pt-fe^)j dr. 
Now, putting these together, 

Pt-riXr) - 2pt-T[zT) +Pt-T{yT) = Pt[x) - 2pt{z) + Pt{y) + Mr 



(84) 



Apt-u{zu)j du 

e{u){Vpt-u{u),%)du 



where A/,- is a martingale. 

From the definition of a in Corollary 17 and the fact that we stop the processes before the distance 
between x and y hits some small number tq, it is not hard to prove that there is a constant C > such that 

\a{u) - a{u)\ < Cpid{x,y), 

which in turn, using the gradient decay estimates and the fact that An) < d{xu, yu) /2 = Pu/2, leads to 



-2pt_„(2„) 



< Cp,,u + |e-2P'--(^") - e-2p-*-"(A.)| < cpi + Ce-"p„ < Cp, 



Observe here that we do not need the full power of the exponential decay of the gradient. Just the bounded- 
ness suffices for this particular estimate, but used in conjunction with the definition of 6 from Corollary 17 
it justifies 



|6'(u)(Vpt_„(u),7„)| < cpi. 



-ct 



Finally, from the exponential decay of the gradient and elementary arguments. 



g-2pt_„(£!;„) _ g-2pt_„(2:„) 



and also 



-225t_„(i;„) 



-2j5t_„(A„) 



g-2pt_„(y„) _ g-2pt_„(2:„) 



g-2pt_„(3y„) _ g-2pt_„(A„) 



< CpuC 



< 



CPuC 



-Ct 



-Ct 



Now, let a be the first time u when pi „ or p2,u becomes 0, and let ( be the first time u when p„ hits tq, a 
small number (less than half of the injectivity radius). Replacing rhy t A a A ( in (84) and then taking the 
expectation at t = and t = s, combined with the above estimates, leads to 

\pt{x) - 2pt{z) +Pt{y)\ <\E[pt-sA.aAc{XsA.aAc) ~ 2pt-sAaAc{ZsAaAc) + Pt-sAaAciVsAaAi)]] 

(85) 









psAaAC 




/ Pudu 


+ cE 


/ PuHt-udu 




Jo 




Jo 
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Next, the stopping time ct is Ti A T2, where Ti and T2 are, respectively, the first time pi hits and the first 
time p2 hits 0. Now we can write 

t — s/\aA c(ysA<TAc)] 

+ ^[Pt-sArj{XsAa) " 2pt_sA<T (^sAtr ) + Pt-sAaiV sAa) , < C] 
(86) ^E[pt-SA.C{XSAC) - 2pt-sAc(^sAc) + Pt-sAc{ysA.c)X < 



The main point of this expresson is that, due to the symmetry with respect to pi and p2 from Corollary 17, 
we have the crucial cancellations 



(87) E[pt_Ti iVT, ) - Pt-T, {xtJ, Ti < < s a C] + E[Pt~T, {xt, ) - Pt-T, (yrj, T2 < Ti < s A C] - 



(88) E[pt-TAyT,)-Pt-TAxT,),Ti<s<T2AC]+E[pt-TAxT,)-Pt-TAyT,),T2<s<T,AC]^0. 
Furthermore, from the exponential decay of p and Vp, for any s e [0,t] we have 

\E[pt-sAcrAi;{XsAaAc) ~ 2pt_sAcr AC (^sActAC ) + Pt-sAtrAC (J/sActAC )] I 

<|IE[pt_sAc(a;sAc) ~ 2pt_sAc(2:sAc) +Pt-sAc(2/sAc)>C < + |E[Pt-s(a;s) - 2pt_s(2;3) s < ct < C]| 

<ce-"P(C < s A ct) + ce-^*E[ps,s < ct A C]- 
where we used the following inequalities 

\^\Pt-sAc{XsAc) - 2pt-sAc(^sAc) +Pt-sAc{ysAc)X < <^]\ 

< \E[pt-six,) - 2pt-s{z,)+pt-,iys),s < C < cr]| + \E[pt-c{H) - '2pt-c{zc)+Pt-dy<;)X<sA(7]\ 

< ce-^*E[ps, s < CT A C] + ce^'^*P(C < s A ct). 

Putting these together into (85), plus a little simplification, gives that for any s € [0, t] 

\pt{x) - 2ptiz) + pt{y)\ <ce-"P(C < s A ct) + ce-^'E[p,, s < ct A C] 



A further simplification is due to the symmetry with respect to pi and p2 from Corollary 17, which has the 
effect that 



The key step forward is the following result. 

Theorem 19. Let W^, W"^, and W"^ be three independent, one-dimensional Brownian motions, and let pi and p2 be 

two processes such that pi Q — p2fl ~ po > and 



+ E[pt-Ti{yTi) - Pt~TAxTi),Ti < T2 < S A C] +E[pt-T2{XT2) " ft-Ta (2/T2 ) , 72 < Ti < S A (] 
+ E[pt-Ti {yXi ) - Pt-Ti (xTi ) , Ti < S < Ts A C] + E[pt-T2 {xt2 ) - Pt^T2 (Z/Ta ) , 72 < s < Ti A C] 
+ E[pt-s{Xs) - 2pt-s{Zs) +Pt^s{ys),S < CT < C]. 



and also 




E[pu,u < CT A C] = 2E[pi_„, u < CT A C], 

and thus for s e [0, i], 

\pt{x) - 2pt{z) + pt{y)\ <Ce-^*P(C < s A ct) + Ce-^*E[pi,„ s < ct A C 

(89) , r.. , ... . ^ r„ 




(90) 1^ 
with Al + B'^ = 1. 



dpi^r = (1 + 0{pi,r)){ArdW^ + BrdW^) + (1 + 0{pi^r))dW^ + 0(l)dr 

dp2,r = (1 + 0{p2.r)){ArdWl + BrdW^) - (1 + 0{p2,r))dW^ + 0(l)dr 
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Let a be the first hitting time of Q for the process pip2 and Q the first time either pi or p2 hits some value tq. 
Assume that (90) fs valid for re [0, tr A Q, and in addition that for some constant C > 

(91) E[p2,s,s < ctAC] <CV.[pi^s,s<d ^C\ for alls e [0,1 At]. 
Then, there is a constant C > such that, for all s e [0, 1 A and sufficiently small pq > 0, 

(92) E[pi,s,s < a AC] <Cpl/^ 
and 

(93) ¥{C<sAa)<Cpl 

Proof. If we regard the process {pi^r, P2,r) as a process in the first quadrant, the equations in (90) give the 
property that near the axes the process behaves as a Brownian motion. 

To give a bit more insight, what we want to do is to compare E[/3i,s, s < a] with the analogous quantity 
in which pi and p2 run as independent Brownian motions. 

In the simplest case, {pi,p2) is \/2 times a planar Brownian motion started at (po, po) andE[/(pi,s, p2,s), s < 
a] is simply (p{s, pa, po), with tf being the solution to the following PDE on the upper-right quadrant 
^ = {{x,y),x,y > 0}, 

ip{t,{x,y))^0, {x,y)edn 
^{0,{x,y)) = f{x,y), {x,y)en. 

This solution can be written in terms of the heat kernel, which we discuss now. On the half line, the heat 
kernel for the Laplacian with the Dirichlet boundary condition is given by 

ht{x,y) = e * - e * 



for all y, t > 0. On fi, the heat kernel with the Dirichlet boundary condition is simply 

ht((a::i,X2), {yi,y2)) = ht{xi,yi)ht{yi,y2)- 
Turning back to the PDE (94), the solution is given by 

/•oo /"OO 

^it,x,y)= / ht{{x,y),{xi,yi))f{xi,yi)dxidyi. 
Jo Jo 

For the case we are most interested in, namely f{x, y) = x, the solution above can be computed as 

ip{s, x,y) ^ x^ (^) ^^^'^ ^ ^ I 

Now we go back to the system (90) and take (p{s — t, pi^^, P2^t) as a semimartingale which, from Ito's 
formula and dtf— | A(/3, becomes 

dip{s - T,/5i,r,P2,r) =dxf dpi^r + dyip dp2,T - dfipdr + ^dl^ipd{pi)r + dlyipd{pi,p2)T + ^dyy(pd{p2)T 

s - r Vs -T \y/s - T J 

where Mr is a martingale. Since $' and y^"{y) are bounded, we deduce that the drift in the above is 
bounded in absolute value by ^LilliiJ^S^ Now replacing rhy t Air A( and evaluating at r = and r = s, 
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we are led to 



Pl,T + P2,T 



dT 



Po 



c 



E[Pl,r +P2,T,T < 0-AC] 



dr. 



\/ S — T 

Denote for simplicity /(s) = E[/5i.s, s < ct A C] and g{s) — Cpa^ (^) ' Now condition (91) implies 
(95) f{s)<g{s)+C r ^kLdr. 

Jo VS-T 

This functional inequality is interesting enough to be treated separately, and so we do this formally in 
the following. 

Lemma 20. Assume /, g : [0, t] — > [0, oo) are bounded, continuous functions such that for all s G [0, 1 A 



(96) 



f{s)<g{s) + C 



\/s - T 



dr. 



If9{s) < Cp^/y/sfor all s e [0, 1 A t], then 



f{s) < Cp'^/^/sforall s e (0, 1 A 

Proof. Rewrite (96) in the form 

f{s) < gis) + CV~s 



■ r J^dr = g{s) + CV:s f 
Ja Vs-T Jo 



dw. 



Now introduce the random variable W with density ^^/jn^ ^^'^ observe that the right hand side of the 
above equation becomes g{s) + C y/sE[f (sW)]. Hence, the inequality at hand can be re-written as 

fis) <g{s) + C^E[f{sW)]. 

Iterating this inequality, one can prove that if we pick an iid sequence Wi , W2 , . . . with the same distribution 
as W, then for any n > 1, 

n 

f{s) <Y,iCV^)'E[Vw;^n^W2 ■ ■ . VWiW2...Wk-ig{sWiW2 . . . Wk)] 

fe=0 

+ {C^)"E[^/m^WiW2.. . ^yWiW2...Wnf{sWiW2 . . . Wn+l)]. 

The random variable W has moments 

Particularly important is the case of fc = —1/2, so that is integrable, and in fact E[l/\/W^] = 7r/2. It is 
an elementary task to obtain from this that, for some constant C > 0, 

E[W''] < C/Vk for all fc > 0. 

Since g is bounded, the series 

CO 

J2iCV^fE[^i^WiW2 . . . ^WiW2...Wk-ig{sWiW2 ...Wk)] 

k=0 

is absolutely convergent and {C ^y'E[^/W^^/^\W^ . . . s/W^jV^T:?Wnf{sWxW2 . . ■ VK„+i)] goes to as n ^ 
00. Consequently, 

00 

/(s) < ^{C4~sf^.^l^W^W2 . . . ^WiW2 . . . Wk-ig{sWiW2 . . . Wk)]. 

fe=0 
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If 9{s) < Cp'^/^/s, the above yields 

2 ^ 



k=0 



W[y'WiW2 . . . ^/WiW2 . . . Wk-1 

y/WiW2...Wk 



where we used the decay of the moments of W together with the fact that 1/ v is integrable to justify that 
the series is convergent. □ 

The rest of the proof of (92) follows now from Lemma 20. 

We now turn our attention to (93) and observe that, from (90), we easily deduce that 

dplP2 = Pldp2 + hdpi +d(/5i,p2>r 
= dMr + 0{pi + p2)dT 

with Mr a martingale. Using this at the times t ~ and t ^ s A a A C, with < s < 1 A t and uitegratuig, 
we get 



fo^P(C <sAa)< E[p,^,^,^^P2,sAa^c^ <p'o + C'- 



7E 



{Pl.r + P2,T)dT 



<pI+C I E[{pi^r + P2,r),T < a A C]d7 
(91)&(92) 



< pI + c 



' ~pI 



dT = Cpl 



which is what we needed. □ 

Now we go back to (89). We cannot use Theorem 19 to conclude that E[/oi s, s < cr A C] < Cpo/V^ because 
the equations satisfied by pi and p2 are not of the form (90). However, if we take pi,s = pi.sS^'^^'-^^''/ 
Pi,s = /oi,se^'"°*^^"\ then (45) and an application of Ito's formula (followed by several rearrangements) show 
that pi and p2 do satisfy (90). In addition. Corollary 17 combined with the fact that e^'-=*^^=' is bounded 
shows that (91) is also satisfied. Therefore, according to Theorem 19, E[/5i^s, s < a A < Cp^j ^fs and this 
in turn implies 

E[pi.s, s < A C] < C'/9o/\/s and / E[pi,„, u < cr A C]rfw < Cpq\/s. 

^0 

Using the preceding in (89), we write the resulting equation as 

Ht-u 



'Ct 



-d-u for any s e [0, 1 A i] . 



Now dividing both sides by Po ^^id then letting po tend to 0, we arrive at 



Ht < c- 



Ht~u 



du for any s G [0, 1 A t]. 



From here, the rest is taken care of by the following lemma. 

Lemma 21. If H : [0, oo) ^ [0, oo) fs a continuous function such that, for some constant C > 0, 



(97) 



Ht < c 



-ct 



du] , < s < 1 A i, 



then there are constants k,K > such that 



Ht<Ke~''^ forallt>0. 
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Proof. It suffices to concentrate on the case t > 1. 

Let m„ = suptg [„ „_,_ Ht and M„ = suptg[„_i „+i] Ht. Clearly, m„ < M„ and Af„ is either m„ or m„+i. 

Now, if we take the t which maximizes Ht on [n, n+1] and use (97), we get that for some constant C > 
and any s £ [0, 1], 

rrin < c{ — ^ + \/sAf^ 
V Vs 

We want to minimize the right hand side of the above expression over s G [0, 1]. For any a,b > 0, the 
minimum of a/^s + 6-^5 with s G [0, 1] is attained at | A 1. Hence 

m„ < c 

We split the analysis according to the following cases: 

(1) Case: e"^"/^ < M„. This leads first to e"^"/Af„ < e-'^"/^ < 1, and then to 

This is enough to conclude that we can find a large rii such that for all rt > ni one gets m„ < M„/2, 
which means that we cannot have Af„ = riin unless irin — nin-i — 0. Hence Af„ = m„_i, which in 
turn implies that for some fc > 

(*) m„ < e~'^m„_i if n > ni. 

(2) Case: A4n < e-'^"/^. This yields 

Notice that we can arrange the constant fc > to be the same in (*) and (**) simply by taking the 
smaller. 

By combining (*) and (**), we can show that m„ decays exponentially fast. Indeed, if there is n2 > ni for 
which the second alternative holds, then m„2 < e^*^"^ . Then an easy induction and use of both alternatives 
yields that m„ < e^'^" for all n > n2- On the other hand, if there is no such n2, that means the second 
alternative holds, and this means that m„ < TOn-ie^*^ for all n > This then results in m„ < rn„je~'^^"~"i' 
and thus in the exponential decay. □ 

This completes the proof of Theorem 18. □ 

11. C'' CONVERGENCE OF p ON SURFACES WITH x(A/) < 

In the previous two sections, using the same notation and assumptions, we proved there exists a constant 
C > such that 

(98) sup \ptix)\ + sup \\7pt{x)\ + sup |Hesspt(x)| < ce"*^* for all t > 0. 

xeM xeM xeM 

Alternatively stated, p converges to exponentially fast in the C^-norm. In particular, this proves that the 
metric gt converges to the constant curvature metric h in the -topology, and thus the curvature of gt 
converges uniformly to a constant. 

We now complete our discussion of the convergence to the constant curvature metric by extending this 
to C°° -convergence. The culmination of the last several sections is the following theorem. 

Theorem 22. Let M be a smooth, compact surface with xiM) < 0, with a reference metric h of constant curvature 
or — 1, and let go be a smooth initial metric in the same conformal class as h and with the same area. Then if we let 
ptfor t e [0, oo) be the associated solution to the normalized Ricciflow (as given in equation (9)), we have that 

pt ^ in C°°, exponentially fast, 



STOCHASTIC APPROACH TO RICCI FLOW 45 

in the sense that this convergence takes place exponentially fast in the C'' -norm for all positive integers k. Stated 
differently, if gt for t e [0, oo) is the family of solution metrics to the normalized Ricci flow (and so the metrics 
corresponding to pt), then gt ^ h in C°°, exponentially fast. 

Proof. We start with the equation 

dtp = e-^P'Apt + r(l - e-^P'). 

Now we can assume, by induction, that all derivatives of pt of order I with < I < k — 1 decay to 
exponentially fast as t goes to infinity. In light of the -convergence, we may assume that fc > 3. 

Taking the fcth derivative pj*^' = V^'^'pt, after commuting the Laplacian with the covariant derivative we 
obtain 

(99) dtp'i^^ = e^2p,^-(fc) _^ 2re-2?^*pf') + q('=) 

where Q'' depends on the lower order derivatives of pt, and thus we may assume by induction that for 

fc > 2, 

(100) IQi^^l < ce-^*. 

The idea now is to write a Feynman-Kac formula for the solution to (99) and get the estimates from this. 
Indeed, notice that if is the time changed Brownian motion starting at x which is defined by (12), then 

(101) ^^P(2^/ e'P^-^^-Uu^ %pi%{x„) - """"^{^^l e-P*--^^-Uv^ TuQi'l'uMdu 

is a martingale, where Tu is the extension to tensors of the parallel transport along the path q] from Xu 
to Xq = X. From the technical side this expression can be seen in a clear way by lifting the equation (99) 

to the orthonormal frame bundle, where the lift of p^*^' takes values in a tensor product space of a fixed 
2-dimensional Euclidean space. This is standard in stochastic analysis and we do not belabor it. 
One result of equation (101) is that evaluation at cr = and cr = t gives 

(102) 



p't''\x)=E cxp(^2r^ e-P'-'-'=-^du^%pi''\xa) -E ^ cxp (^2r^ e^P'-^^'^-Uv^ TuQ^'^JuMd^ 

Notice the first consequence of this, namely that IPt*^"*] is bounded for r < (which is the case under 
consideration). We consider separately the cases r = — 1 and r = 0. 

Case: r = —1. From the exponential decay of pt and the induction hjrpothesis (the decay of Qj'"^) it is 
easy to see that 

\p[''\x)\ < ce-^'* foralH > 0, 

and thus the induction is done. 

Case: r — 0. For the flat case, we still learn from (102) that pl'^^ (x) is uniformly boimded in t and x. Since 
the curvature of the underlying metric /i is we know (cf. [16, Theorem 8.1]) that the holonomy groups are 
trivial (perhaps after lifting to the orientation cover). Stated differently, the parallel transport along loops is 
the identity. 

To finish the argument, we are going to use the coupling technique we already used for the gradient 
estimates. Start with a fixed point x E M and a unit vector ^, and write 

.1n^^ -k( \c Y7 -(fe-i) r ThP?^^\l{h)) - pf~'^\x) 
(103) Pt (x)i = V^pI = hm , 

where here Th is the parallel transport from to along the geodesic 7 started at x with initial velocity 

t 

Now we use the martingale representation (101) with k replaced by (fc — 1) to see that, for x and y close 
enough and T the parallel transport from Ty to along the minimizing geodesic. 



Tpf-'\y)-pf''\x) = E TT„pTL-"{v.) - %pt7'M 



{TTuQtJHvu) ~ ruQt'J\x^)) du 
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Take t > 1 and let <t be 1 A r with r the coupling time of and ?/„. Now, because the holonomy group is 
trivial, it follows that 



E 







From this and the exponential decay of p['' and Qf' ^\ we have 

\Tp'i'"~^\y) - pi''"^\x)\ < e-^*P(l < r) +e-" / V{u < T)du 
Finally, using the estimate (44), we get 

|rp-r'Hy)-p-M(x)| <e-^*d(x,y) + e-^' ^^du ^ Ce-^'d{x,y) 

Jo Vu 

Now taking y ~ l{h) and considering the limit as h goes to leads to 

bf)(x)C|<ce-" 
for any unit vector ^, which implies the exponential convergence 
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